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The present invention provides methods for use in identifying, analyzing and typing polymorphic DNA fragments, particularly 
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Polymerases for Analyzing or Typing Polymorphic 
Nucleic Acid Fragments and Uses Thereof 

FIELD OF THE INVENTION 

The present invention is in the field of molecular and cellular biology. The 
invention relates to compositions and methods for use in analyzing and typing 
polymorphic regions of DNA. More particularly, the invention is directed to 
compositions of polymerases (preferably DNA polymerases and most preferably 
thermostableDNApolymerases), and methods usingthese compositions, whereby 
polymorphic, minisatellite, microsatellite or STRDNAfragmentsmay be amplified 
and analyzed. The compositions and methods of the present invention are useful 
in a variety of techniques employing DNA amplification and polymorphism 
analysis, including medical genetic, forensic, and plant breeding applications. 

The present invention also relates to polymerases having reduced, 
substantially reduced or eliminated ability to add one or more non-templated 
nucleotides to the 3* terminus of a synthesized nucleic acid molecule. Preferably, 
the polymerases of the invention are thermostable or mesophilic polymerases. 
Specifically, the polymerases of the present invention (e.g., DNA or RNA 
polymerases) have been mutated or modified to reduce, substantially reduce or 
eliminate such activity (compared to the unmodified, unmutated, or wild type 
polymerase), thereby providing a polymerase which synthesizes nucleic acid 
molecules having little or no non-templated 3* terminal nucleotides. Such 
polymerases thus have enhanced or greater ability to produce a double stranded 
nucleic acid molecule having blunt ended termini which may facilitate cloning of 
such molecules. The present invention also relates to cloning and expression of the 
polymerases of the invention, to nucleic acid molecules containing the cloned 
genes, and to host cells which express said genes. The polymerases of the present 
invention may be used in DNA sequencing, amplification, nucleic acid synthesis, 
and polymorphism analysis. 

The invention also relates to polymerases of the invention which have one 
or more additional mutations or modifications. Such mutations or modifications 
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include those which (1) substantially reduce 3'-+ 5' exonuclease activity; and/or (2) 
substantially reduce 5'-+3 % exonuclease activity. The polymerases of the invention 
can have one or more of these properties. These polymerases may also be used 
in nucleic acid analysis including but not limited to DNA sequencing, 
5 amplification, nucleic acid synthesis, and polymorphism analysis. 

BACKGROUND OF THE INVENTION 

DNA Structure 

The genetic framework (/. e. , the genome) of an organism is encoded in the 
double-stranded sequence of nucleotide bases in the deoxyribonucleic acid (DNA) 
which is contained in the somatic and germ cells of the organism. The genetic 
content of a particular segment of DNA, or gene, is only manifested upon 
production of the protein which the gene ultimately encodes. There are additional 
sequences in the genome that do not encode a protein (i.e., "noncoding" regions) 
which may serve a structural, regulatory, or unknown function. Thus, the genome 
of an organism or cell is the complete collection of protein-encoding genes 
together with intervening noncodingDNA sequences. Importantly, each somatic 
cell of a multicellular organism contains the full complement of genomic DNA of 
the organism, except in cases of focal infections or cancers, where one or more 
xenogeneic DNA sequences may be inserted into the genomic DNA of specific 
cells and not into other, non-infected, cells in the organism. 

Minisatellite and M icrosatellite DNA 

Interspersed throughout the genomic DNA of most eukaryotic organisms 
are short stretches of polymorphic repetitive nucleotide sequences known as 
"minisatellite DNA" sequences or fragments (Jeffreys, A.J., etaL, Nature 314:61- 
25 73 (1985)). These repeating sequences often appear in tandem and in variable 

numbers within the genome, and they are thus sometimes referred to as "short 
tandem repeats" ("STRs") or Variable numbers of tandem repeats" ("VNTRs") 
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(see U.S. Patent No. 5,075,217; Nakamura et al. 9 Science 235:1616-1622 
(1987)). Typically, however, minisatellite repeat units are about 9 to 60 bases in 
length (Nakamura et al. y Science 235: 1616-1622 (1987); Weber and May, Am. J. 
Hum. Genet ¥4:388-396 (1989)) whicharerepeated in tandem about 20-50 times 
(Watson, J.D., et ah, eds., Recombinant DNA, 2nd ed., New York: Scientific 
American Books, p. 146 (1992)). Other short, simple sequences which are 
analogous to minisatellite DNAs, termed "microsatellite DNAs" (Litt, M, and 
Luty, J.A., Am. J. Hum. Genet 4*397-401 (1989); Weber and May, Am. J. Hum. 
Genet. 44:388-396 (1989)), are usually about 1-6 bases in repeat unit length and 
thus give rise to monomelic (Economou, E.T., et al, Proc. Natl. Acad. Sci. USA 
57:2951-2954 (1990)), dimeric, trimeric, quatrameric, pentameric or hexameric 
repeat units (Litt, M., and Luty, J. A., Am. J. Hum. Genet 44:397-401 (1989); 
Weber and May, Am. J. Hum. Genet. 44:388-396 (1989)). The most prevalent 
of these highly polymorphic microsatellite sequences in the human genome is the 
dinucleotide repeat (dC-d A) n *(dG-dT) D (where n is the number of repetitions in 
a given stretch of nucleotides), which is present in a copy number of about 
50,000- 1 00,000 (Tautz, D. , and Renz, M., Nucl. Acids Res. 12 A\ 27-4 138(1 984); 
Dib, C, et al. y Nature 3*0:152-154 (1996)), although the existence of a variety 
of analogous repeat sequences in the genomes of evolutionarily diverse eukaryotes 
has been reported (Hamada, H, et al, Proc. Natl. Acad Sci. USA 79:6465-6469 
(1982)). 

The actual in vivo function of minisatellite and microsatellite sequences is 
unknown. However, because these tandemly repeated sequences are dispersed 
throughout the genome of most eukaryotes, exhibit size polymorphism, and are 
often heterozygous (Weber, J.L., Genomics 7:524-530 (1990)), they have been 
explored as potential genetic markers in assays attempting to distinguish closely 
related individuals, and in forensic and paternity testing (see, e.g., U.S. Patent No. 
5,075,217; Jeffreys, A. J., et al. y Nature 352:278-281 (1988)). The finding that 
mutations often are observed in microsatellite DNA regions in cancer cells (Loeb, 
L.A, Cancer Res. 54:5059-5063 (1994)), potentially linking genomic instability 
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to the carcinogenic process and providing useful genetic markers of cancer, lends 
additional significance to methods facilitating the rapid analysis and genotyping of 
polymorphisms in these genomic DNA regions. 

Methods of Genotyping Minisatellite or STR DNA Sequences 

To analyze minisatellite, microsatellite or STR DNA sequence 
polymorphisms, a variety of molecular biological techniques have been employed. 
These techniques include restriction fragment length polymorphism (RFLP) or 
"DNA fingerprinting" analysis (Wong, Z., et al., Nucl Acids Res. 74:4605-4616 
(1986); Wong, Z., et al.Ann. Hum. Genet 57:269-288 (1987); Jeffreys, A.J., et 
al 9 Nature 532:278-281 (1988); U.S. Patent Nos. 5,175,082; 5,413,908; 
5,459,039; and 5,556,955). Far more commonly employed for STR genotyping 
than RFLP and hybridization, however, are amplification-based methods, such as 
those relying on the polymerase chain reaction (PCR) method invented by Mullis 
and colleagues (see U.S. Patent Nos. 4,683,195; 4,683,202; and 4,800,159). 
These methods use "primer" sequences which are complementary to opposing 
regions flanking the polymorphic DNA sequence to be amplified from the sample 
of genomic DNA to be analyzed. These primers are added to the DNA target 
sample, along with excess deoxynucleotides and a DNA polymerase (e.g., Tag 
polymerase; see below), and the primers bind to their target via base-specific 
binding interactions (i.e., adenine binds to thymine, cytosine to guanine). By 
repeatedly passing the reaction mixture through cycles of increasing and 
decreasing temperatures (to allow dissociation of the two DNA strands on the 
target sequence, synthesis of complementary copies of each strand by the 
polymerase, and re-annealing of the new complementary strands), the copy 
number of the minisatellite or STR sequence of DNA may be rapidly increased, 
and detected by size separation methods such as gel electrophoresis. 

PCR and related amplification approaches have been used in attempts to 
develop methods for typing and analyzing STRs or minisatellite regions. For 
example, PCR has been employed to analyze polymorphisms in microsatellite 
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sequences from different individuals, including (dC-dA)n(dG-dT)n (Weber, J.L, 
and May, P.E., Am. J. Hum. Genet 4*388-396 (1989); Weber, J. L., Genomics 
7:524-530 (1990); U.S. Patent Nos. 5,075,217; 5,369,004; and 5,468,613). 
Similar methods have been applied to a variety of medical and forensic samples to 
perform DNA typing and to detect polymorphisms between individual samples 
(U.S. Patent Nos. 5,306,616; 5,364,759; 5,378,602; and 5,468,610). 



In Vitro Use of DNA Polymerases 

The above-described amplification-based techniques require the use of 
DNA polymerases, which catalyze the addition of deoxynucieoside triphosphate 

10 (dNTP) bases into the newly forming DNA strands. Together with other enzymes 

(e.g., helicases, ligases and ATPases), the DNA polymerases ensure rapid and 
relatively faithful replication of DNA in preparation for proliferation in vivo in 
prokaryotes, eukaiyotes and viruses. 

DNA polymerases synthesize the formation of DNA molecules which are 

15 complementary to a DNA template. Upon hybridization of a primer to the single- 

stranded DNA template, polymerases synthesize DNA in the 5' to 3 1 direction, 
successively adding nucleotides to the 3 , -hydroxyl group of the growing strand. 
Thus, in the presence of deoxyribonucleoside triphosphates (dNTPs) and a primer, 
a new DNA molecule, complementary to the single stranded DNA template, can 

20 be synthesized. 

In addition to an activity which adds dNTPs to DNA in the 5' to 3' 
direction (i.e., "polymerase" activity), many DNA polymerases also possess 
activities which remove dNTPs in the 5' to 3 1 and/or the 3 1 to 5 f direction (i.e., 
"exonuclease" activity). This dual activity of certain DNA polymerases is, 

25 however, a drawback for some in vitro applications. For example, the in vitro 

synthesis of an intact copy of a DNA fragment by the polymerase activity, an 
elongation process which proceeds in a 5 1 to 3' direction along the template DNA 
strand, is jeopardized by the exonuclease activities which may simultaneously or 
subsequently degrade the newly formed DNA. 
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Limitations of PCR-based Gcnotyping of Minisatellite, Microsatellite and 
STR DNA Sequences 

Application of PCR-based methods to analysis of minisatellite or STR 
DNA sequences has a number of significant limitations. It has been shown, for 
example, that use of Taq and other thermostable DNA polymerases commonly 
employed in PCR and related automated amplification methods causes the 
accumulation of amplification products containing non-templated 3' terminal 
nucleotides (Clark, J.M, etal, J. Molec. Biol 795:123-127 (1987); Clark, J.M., 
Nucl. Acids Res. 76:9677-9686 (1988); Hu, G., DNA Cell Biol. 72:763-770 
(1993)). That is, some of the newly synthesized DNA strands produced in each 
round of amplification have had an extra nucleotide added to their 3 1 termini, such 
that the newly synthesized strands may be longer by one base. 

Non-templated nucleotide addition is a slow process compared to 
template-directed synthesis (Clark, JM, Nucl Acids Res. 76:9677-9686 (1988)), 
and its extent is sequence-dependent (Hu, G.,DNA CellBiol 72:763-770(1993); 
Brownstein, M.J., et at, BioTechniques 20:1004-1010 (1996)). Consequently, 
the PCR product is often heterogeneous in regard to extra nucleotide addition 
depending upon the primers and the reaction conditions used by the investigator 
(Magnuson, V.L., etal, BioTechniques 27:700-709 (1996)). Extra nucleotide 
addition, in combination with "stutter" due to slippage during PCR amplification 
(Levinson, G., and Gutman, G.A., Molec. Biol Evol 4:203-221 (1987); 
Schlotterer, C, and Tautz, D., Nucl AcidsRes. 20:21 1-215 (1992)), often results 
in complex DNA fragment patterns which are difficult to interpret, especially by 
automated methods. This can result in improper genotyping analysis, particularly 
if the percentage of non-templated nucleotide addition is between 30-70% of the 
PCR product (Smith, J.R., etal., Genome Res. 5:312-317 (1995)). 

Thus, a need currently exists for a rapid, automated method for identifying, 
analyzing and typing polymorphic DNA fragments, particularly minisatellite, 
microsatellite or STR DNA fragments, that will not result in the problematic 
results described above. The present invention provides such a method. 
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BRIEF SUMMARY OF THE INVENTION 

The present invention satisfies these needs in the art by providing methods 
useful in the identification, analysis or typing of polymorphic DNA fragments, 
particularly minisatellite, microsatellite or STR DNA fragments, in samples of 
5 DNA from a cell, particularly a eukaryotic cell. Specifically, the invention 

provides a method of producing a population of amplified DNA molecules, for use 
in analyzing or typing a DNA molecule in a DNA sample isolated from a cell, 
preferably a eukaryotic cell. The method of the present invention comprises 
contacting a DNA sample with a DNA polymerase (preferably a thermostable 
10 DNA polymerases) reduced, substantially reduced or eliminated in the ability to 

add one or more non-templated nucleotides to the 3' terminus of aDNA molecule, 
amplifying a polymorphic DNA fragment, preferably a minisatellite, microsatellite 
or STR DNA fragment, within the DNA sample and analyzing the amplified 
polymorphic DNA fragment. In the method of the invention, the analysis step may 
15 comprise, for example, sizing or sequencing the amplified DNA molecule and 

optionally comparing the size and/or sequence of the amplified DNA molecule to 
a different DNA sample which has been amplified according to the invention. In 
preferred embodiments of the present invention, the thermostable DNA 
polymerase is a Thermotoga DNA polymerase, preferably a Thermotoga DNA 
20 polymerase substantially reduced in 3'-5* exonuclease activity, more preferably a 

Tne polymerase, a Tma polymerase, or a mutant or derivative thereof, and most 
preferably a mutant of Tne polymerase selected from the group consisting of 
Tne NA219, D323A; Tne NA283, D323A; Tne NA284, D323A; Tne N'A193, 
D323A; 7weD137A, D323A; 7weD8A, D323A; Tne G195D, D323A; Tne G37D, 
25 D323A; TneNMM; Tne D137A, D323A, R722K; Tne D137A, D323A, R722Y; 

Tne D137A, D323A, R722L; Tne D137A, D323A, R722H; Tne Dl 37A, D323A, 
R722Q; Tne D137A, D323A, F730Y; Tne D137A, D323A, K726R; Tne D137A, 
D323A, K726H; Tne D137A, D323A, R722K, F730Y; Tne D137A, D323A, 
R722K, K726R; Tne D137A, D323A, R722K, K726H; Tne D137A, D323A, 
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R722H, F730Y; Tne D137A, D323A, R722H, K726R; Tne D137A, D323A, 
R722H, K726H; Tne D137A, D323A, R722Q, F730Y; Tne D137A, D323A, 
R722Q, K726R; Tne D137A, D323A, R722Q, K726H; Tne D137A, D323A, 
R722N, F730Y; Tne D137A, D323A, R722N, K726R; Tne D137A, D323A, 
R722N, K726H; Tne D137A, D323A, F730S; Tne NA283, D323A, 
R722K/H/Q/N/Y/L; 7»eN'A219,D323A,R722K; r«eN'A219,D323A,F730Y; 
7>,eNA219 ) D323A, K726R; JneNA219,D323A, K726H; TneDUlA, D323A, 
F730S, R722K/Y/Q/N/H/L, K726R/H; Tne D137A, D323A, F730T, 
R722K/Y/Q/N/H/L, K726R«; Tne D137A, D323A, F730T; Tne F730S; Tne 
F730A; 7he K726R Tne K726H; and 7«e D137A, D323A, R722N. The present 
invention is particularly directed to the above methods wherein the eukaryotic cell 
is an plant cell or an animal cell, preferably a mammalian cell, more preferably a 
normal, diseased, cancerous, fetal or embryonic mammalian cell, and most 
preferably a human cell. The invention is also directed to the above methods, 
further comprising isolating the polymorphic, minisatellite, microsatellite or STR 
DNA fragment and inserting it into a vector, preferably an expression vector. By 
the present methods, the polymorphic or microsatellite DNA fragment may be 
amplified prior to being inserted into the vector. 

The present invention also provides a method of determining the 
relationship between a first individual and a second individual, comprising 
contacting a DNA sample from the first and second individuals with a DNA 
polymerase {e.g. a thermostable DNA polymerase) reduced, substantially reduced 
or eliminated in the ability to add one or more non-templated nucleotides to the 
3' terminus of a DNA molecule, amplifying one or more DNA molecules in the 
DNA sample to generate a collection of amplified polymorphic DNA fragments, 
separating the amplified DNA fragments by length, and comparing the pattern of 
amplified DNA fragments from the first individual to that of the second individual. 
This method also allows the identification of one or more unique polymorphic 
DNA fragments, particularly a minisatellite, microsatellite or STR DNA fragment, 
that is specifically present in only one of the two individuals. This method may 
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fijrther comprise determining the sequence of the unique polymorphic, 
minisatellite, microsatellite or STR DNA fragment. In this embodiment of the 
present invention, the thermostable DNA polymerase may be a Thermotoga DNA 
polymerase, preferably a Thermotoga DNA polymerase substantially reduced in 
3'-5' exonuclease activity, more preferably a Tne polymerase, a Tma polymerase, 
or a mutant or derivative thereof, and most preferably a mutant of Tne polymerase 
selected from the group consisting of Tne NA219, D323A; Tne NA283, D323A; 
Tne NA284, D323A; Tne NA193, D323A; Tne D137A D323A; Tne D8A, 
D323A; Tne G195D, D323A; Tne G37D, D323A; Tne 1^283; Tne D137A 
D323A R722K; Tne D137A, D323A, R722Y; Tne D137A D323A, R722L; 
7wD137A,D323AR722H; 7>*D137A,D323A R722Q; 7»eD137AD323A, 
F730Y; Tne D137A, D323 A, K726R; Tne D137A D323 A, K726H; Tne D137A 
D323A R722K, F730Y; Tne D137A, D323A, R722K, K726R; Tne D137A 
D323A R722K, K726H; Tne D137A, D323A, R722H, F730Y; Tne D137A, 
D323A R722H, K726R; Tne D137A, D323A, R722H, K726H; Tne D137A, 
D323A R722Q, F730Y; Tne D137A, D323A, R722Q, K726R; Tne D137A, 
D323A R722Q, K726H; Tne D137A, D323A R722N, F730Y; Tne D137A 
D323A R722N, K726R; Tne D137A, D323A, R722N, K726H; Tne D137A, 
D323A F730S; 7>;eN , A283 > D323A R722K/H/Q/N/Y/L; Tne NA219, D323A, 
R722K; 7heNA219,D323A,F730Y; r«gN'A219, D323A.K726R; T«eNA219, 
D323AK726H; r/icD137AD323AF730S,R722K^f/Q/N/H^,K726R/H; Tne 
D137A D323A F730T, KI22JUY/Q/N/WL, K726R/H; Tne D137A, D323A 
F730T; Tne F730S; Tne F730A; Tne K726R; Tne K726H; and Tne D137A, 
D323A R722N. The present invention is particularly directed to the above 
methods wherein the first or second individual is an animal or a plant, and most 
preferably wherein the first or second individual is a human. 

The present invention also provides isolated nucleic acid molecules 
encoding mutant Tne DNA polymerase proteins, wherein the mutant Tne DNA 
polymerase proteins have an amino acid sequence as set forth in any one of SEQ 
ID NOs: 4-10. The invention also provides mutant Tne DNA polymerase proteins 
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having an amino acid sequence as set forth in any one of SEQ ID NOs:4-10, most 

preferably a mutant Tne polymerase protein selected from the group consisting of 

Tne N'A283, D323A (SEQ ID NO:4); Tne NA193, D323A (SEQ ID NO:5); 

Tne D137A, D323A (SEQ ID NO:6); Tne D8A, D323A (SEQ ID NO:7); 
5 Tne G195D, D323A (SEQ ID NO:8); Tne G37D, D323A (SEQ ID NO:9); and 

Tne N'A283 (SEQ ID NO: 10). The invention also relates to nucleic acid 

molecules and the proteins encoded by such nucleic acid molecules for mutant Tne 

polymerases selected from the group consisting of Tne n'A283; Tne D137A, 

D323A, R722K; Tne D137A, D323A, R722Y; Tne D137A, D323A, R722L; 
10 7heD137A, D323A, R722H; 7heD137A, D323A,R722Q; 7>a?D137A, D323A, 

F730Y; Tne D137A, D323A, K726R Tne D137A, D323A, K726H; Tne D137A, 

D323A, R722K, F730Y; Tne D137A, D323A, R722K, K726R; Tne D137A, 

D323A, R722K, K726H; Tne D137A, D323A, R722H, F730Y; Tne D137A, 

D323A, R722H, K726R; Tne D137A, D323A, R722H, K726H; Tne D137A, 
15 D323A, R722Q, F730Y; Tne D137A, D323A, R722Q, K726R; Tne D137A, 

D323A, R722Q, K726H; Tne D137A, D323A, R722N, F730Y; Tne D137A, 

D323A, R722N, K726R; Tne D137A, D323A, R722N, K726H; Tne D137A, 

D323A, F730S; r/ieN'A283, D323A, R722K/H/Q/N/Y/L; 7>ieN'A219, D323A, 

R722K; Tne NA21 9, D323 A, F730Y; 7>ieN , A219,D323A,K726R; 7«eN , A219, 
20 D323A, K726H; Tne Dl 37A, D323A, F730S, R722K/Y/Q/N/H/L, K726R/H; Tne 

D137A, D323A, F730T, R722K7Y/Q/N/H/L, K726R/H; Tne D137A, D323A, 

F730T; Tne F730S; Tne F730A; The K726R; Tne K726H; and Tne D137A, 

D323A, R722N. These mutations may be made to sequence ID NO:2 to produce 

the mutant polymerases having the indicated amino acid mutations (where, for 
25 example, "D137A" indicates that the Asp (D) residue at position 137 in SEQ ID 

NO:2 has been mutated to an Ala (A) residue, and, for example, 

"R722R7Y/Q/N/H/L" indicates that the Arg (R) residue at position 722 in SEQ ID 

NO:2 has been mutated to a Lys (K), Tyr (Y), Gin (Q), Asn (N), His (H) or Leu 

(L) residue). 
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The present invention also provides kits for the identification, analysis or 
typing of a polymorphic DNA fragment, particularly a minisatellite, microsatellite 
or STR DNA fragment, comprising a first container containing one or more DNA 
polymerases reduced, substantially reduced or eliminated in the ability to add non- 
templated 3' terminal nucleotides. Kits according to the invention may contain 
additional containers selected from the group consisting of a container containing 
one or more DNA primer molecules, a container containing one or more 
deoxynucleoside triphosphates needed to synthesize a DNA molecule 
complementary to the DNA template, and a container containing a buffer suitable 
for identifying, analyzing or typing a polymorphic DNA fragment by the methods 
of the invention. Any number of these components of the kit may be combined in 
a single or multiple containers to provide the kit of the invention. According to 
the invention, the DNA polymerase of the kit is preferably a Thermotoga DNA 
polymerase, more preferably a Thermotoga DNA polymerase substantially reduced 
in 3-5' exonuclease activity, still more preferably a Tne polymerase, a Tma 
polymerase, or a mutant or derivative thereof, and most preferably a mutant of 
Tne polymerase selected from the group consisting of Tne N , A283; Tne D137A, 
D323A, R722K; Tne D137A, D323A, R722Y; Tne D137A, D323A, R722L; 
7>ieD137A, D323A, R722H; 7heD137A, D323A, R722Q; Tne D137A, D323A, 
F730Y; Tne D137A, D323A, K726R; Tne D137A, D323A, K726H; Tne D137A, 
D323A, R722K, F730Y; Tne D137A, D323A, R722K, K726R; Tne D137A, 
D323A, R722K, K726H; Tne D137A, D323A, R722H, F730Y; Tne D137A, 
D323A, R722H, K726R; Tne D137A, D323A, R722H, K726H; Tne D137A, 
D323A, R722Q, F730Y; Tne D137A, D323A, R722Q, K726R; Tne D137A, 
D323A, R722Q, K726H; Tne D137A, D323A, R722N, F730Y; Tne D137A, 
D323A, R722N, K726R; Tne D137A, D323A, R722N, K726H; Tne D137A, 
D323A, F730S; Tne N , A283, D323A, R722K/H/Q/N/Y/L; Tne N , A219, D323A, 
R722K; r/?eN , A219,D323A,F730Y; rneN*A219,D323A,K726R; TneWte\% 
D323 A, K726H; TneDUlA, D323A, F730S, R722K/Y/Q/N/H/L, K726R/H; Tne 
D137A, D323A, F730T, R722K/Y/Q/N/H/L, K726R/H; Tne D137A, D323A, 
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F730T; Tne F730S; Tne F730A; Tne K726R; Tne K726H; and Tne D137A, 
D323A, R722N. 

The present invention also relates generally to mutated or modified 
polymerases (DNA or RNA polymerases) which have reduced, substantially 
reduced or eliminated ability to add one or more non-templated nucleotides to the 
3 ' terminus of a synthesized nucleic acid molecule (compared to the corresponding 
wildtype, unmutated or unmodified polymerase). Preferably, such mutant or 
modified polymerases have substantially reduced ability to add one or more non- 
templated nucleotides to the 3' terminus of a synthesized nucleic acid molecule. 
Such polymerases of the invention may be thermostable or mesophilic 
polymerases. Thus, the present invention relates to such mutated or modified 
polymerases and to kits containing such polymerases. The invention also relates 
to the use of such mutant or modified polymerases in a number of procedures 
including DNA sequencing, amplification reactions, nucleic acid synthesis, and 
polymorphism analysis. 

Mutant or modified polymerases of particular interest in the invention 
include Taq DNA polymerase, Tne DNA polymerase, Tma DNA polymerase, Pfu 
DNA polymerase, Tfl DNA polymerase, Tth DNA polymerase, Tbr DNA 
polymerase, /VoDNA polymerase, 2fa/DNA polymerase, Bca DNA polymerase, 
VENT™ DNA polymerase, DEEP VENT™ DNA polymerase, T7 DNA 
polymerase, T5 DNA polymerase, DNA polymerase III, Klenow fragment DNA 
polymerase, StofFel fragment DNA polymerase, and mutants, fragments or 
derivatives thereof. RNA polymerases of interest include T7, SP6, and T3 RNA 
polymerases and mutants, variants and derivatives thereof. 

The present invention relates in particular to mutant Poll type DNA 
polymerases (preferably thermostable DNA polymerases) wherein one or more 
mutations have been made in the O-helix which reduces, substantially reduces or 
eliminates the ability of the enzyme to add one or more non-templated nucleotides 
to the 3' terminus of a synthesized nucleic acid molecule. The O-helix is defined 
as RXXXKXXXFXXXYX (SEQ ID NO: 1 1), wherein X may be any amino acid. 
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The preferred sites for mutation or modification to produce the polymerases of the 
invention are the R and/or F and/or K and/or Y positions in the Ohelix, although 
other changes (or combinations thereof) within the O-helix may be made to make 
the desired polymerase. In this preferred aspect of the invention, R and/or F 
and/or K and/or Y may be replaced with any other amino acid including Ala, Arg, 
Asn, Asp, Cys, Gin, Glu, Gly, His, De, Leu, Lys, Met, Phe, Pro, Ser, Thr, Trp, Tyr 
and Val. 

In accordance with the invention, other functional changes (or 
combinations thereof) may be made to the polymerases having reduced ability to 
add non-templated nucleotides to the 3 f terminus of a synthesized nucleic acid 
molecule. For example, the polymerase may also be modified to reduce, 
substantially reduce or eliminate 5' exonuclease activity, and/or 3' exonuclease 
activity. Thus, the invention relates to mutant or modified DNA polymerases 
having reduced ability to add non-templated nucleotides which are modified in at 
least one way selected from the group consisting of 

(a) to reduce or eliminate the 3'-5' exonuclease activity of the 
polymerase; 

(b) to reduce or eliminate the 5-3' exonuclease activity ofthe 
polymerase; 

Any one or a number of these mutations or modifications (or combinations 
thereof) may be made to provide the polymerases of the invention. Preferred 
polymerases ofthe invention, in addition to having reduced ability to add non- 
templated 3' nucleotides, also have reduced, substantially reduced or eliminated 
3* exonuclease activity. 

The present invention is also directed to nucleic acid molecules (preferably 
vectors) containing a gene encoding the mutant or modified polymerases ofthe 
present invention and to host cells containing such molecules. Any number of 
hosts may be used to express the gene of interest, including prokaryotic and 
eukaryotic cells. Preferably, prokaryotic cells are used to express the polymerases 
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of the invention. The preferred prokaiyotic host according to the present 
invention is E. coli. 

The invention also relates to a method of producing the polymerases of the 
invention, said method comprising: 
5 (a) culturing the host cell comprising a gene encoding a polymerase of 

the invention; 

(b) expressing said gene; and 

(c) isolating said polymerase from said host cell. 

The invention also relates to a method of synthesizing a nucleic acid 
10 molecule comprising: 

(a) mixing one or more nucleic acid templates (e.g. RNA or DNA) 
with one or more polymerases of the invention; and 

(b) incubating said mixture under conditions sufficient to synthesize 
nucleic acid molecules complementary to all or a portion of said templates. Such 

15 condition may include incubation with one or more deoxy- and/or 

dideoxyribonucleoside triphosphates. Such deoxy- and dideoxyribonucleoside 
triphosphates include d ATP, dCTP, dGTP, dTTP, dITP, 7-deaza-dGTP, 7-deaza- 
dATP, dUTP, ddATP, ddCTP, ddGTP, ddlTP, ddTTP, [a-S]dATP, [a-S]dTTP, 
[a-S]dGTP, and [a-S]dCTP. The synthesized nucleic acid molecules may in 

20 accordance with the invention be cloned into one or more vectors. 

The invention also relates to a method of sequencing a DNA molecule, 
comprising: 

(a) hybridizing a primer to a first DNA molecule; 

(b) contacting said molecule of step (a) with deoxyribonucleoside 
25 triphosphates, one or more DNA polymerases of the invention, and one or more 

terminator nucleotides; 

(c) incubating the mixture of step (b) under conditions sufficient to 
synthesize a random population of DNA molecules complementary to said first 
DNA molecule, wherein said synthesized DNA molecules are shorter in length 
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than said first DNA molecule and wherein said synthesized DNA molecules 
comprise a terminator nucleotide at their 3' termini; and 

(d) separating said synthesized DNA molecules by size so that at least a 
part of the nucleotide sequence of said first DNA molecule can be determined. 
5 Such terminator nucleotides include but are not limited to dideoxyribonucleoside 

triphosphates such as ddTTP, ddATP, ddGTP, ddlTP or ddCTP. 

The invention also relates to a method for amplifying a double stranded 
DNA molecule, comprising: 

(a) providing a first and second primer, wherein said first primer is 
10 complementary to a sequence at or near the 3'-termini of the first strand of said 

DNA molecule and said second primer is complementary to a sequence at or near 
the S'-termini of the second strand of said DNA molecule; 

(b) hybridizing said first primer to said first strand and said second 
primer to said second strand in the presence of one or more polymerases of the 

15 invention, under conditions such that a third DNA molecule complementary to 

said first strand and a fourth DNA molecule complementary to said second strand 
are synthesized; 

(c) denaturing said first and third strands, and said second and fourth 
strands; and 

20 (d) repeating steps (a) to (c) one or more times. The amplified double- 

stranded nucleic acid molecules produced by the method of the invention may be 
cloned into one or more vectors. Thus, the invention relates also to a method of 
cloning an amplified DNA molecule comprising: 

(a) amplifying one or more DNA molecules with one or more 
25 polymerases of the invention; and 

(b) ligating said amplified DNA molecules in one or more vectors. 
The invention further relates to a method of cloning a nucleic acid 

molecule comprising: 

(a) mixing a nucleic acid template (or one or more templates) with one 
30 or more polymerases of the invention; 
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(b) incubating said mixture under conditions sufficient to synthesize a 
nucleic acid molecule complementary to all or a portion of said template, thereby 
producing a double-stranded nucleic acid molecule (preferably a double-stranded 
DNA molecule); and 

(c) ligating said double-stranded nucleic acid molecule into one or 

more vectors. 

Preferably, the vectors used for ligating the amplified or synthesized 
double-stranded nucleic acid molecules have blunt ended termini and may be 
prepared by digesting a vector with any one or a number of restriction enzymes 
known in the art which provide blunt end cleavage. Such restriction enzymes 
include Seal, Smal, Hpal, Hindi, HaelU, AM, and the like. 

The invention also relates to kits for sequencing, amplifying, synthesizing 
or cloning of nucleic acid molecules comprising one or more polymerases of the 
invention and one or more other components (or combinations thereof) selected 
from the group consisting of 

(a) one or more dideoxyribonucleoside triphosphates; 

(b) one or more deoxyribonucleoside triphosphates; 

(c) one or more primers; 

(d) one or more suitable buffers; and 

(e) one or more ligases. 

Other preferred embodiments of the present invention will be apparent to 
one of ordinary skill in light of the following drawings and description of the 
invention, and of the claims. 

BRIEF DESCRIPTION OF THE FIGURES 

FIGURE 1 shows the restriction map of the approximate DNA fragment 
which contains the Tne DNA polymerase gene in pSport 1 and pUC19. This 
figure also shows the region containing the O-helix homologous sequences. 
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FIGURE 2A schematically depicts the construction of plasmids pUC-Tne 
(3'^5«)andpUC-TneFY. 

FIGURE 2B schematically depicts the construction of plasmids 
pTrcTne35 and pTrcTne FY. 

5 FIGURE 3 schematically depicts the construction of plasmid 

pTrcTne35 FY. 

FIGURE 4 schematically depicts the construction of plasmids 
pTTQTneSFY and pTTQTne535FY 

FIGURE 5 depicts a plasmid containing the Taq DNA polymerase gene. 

10 FIGURE 6 depicts an autoradiogram showing of the ability of polymerase 

mutants to add non-templated 3' nucleotides. 

FIGURE 7 is an autoradiogram of the product ofPCR amplification of the 
upper and lower alleles of the CD4 locus, using primers corresponding to these 
alleles, demonstrating nontemplated nucleotide addition (n+1) by Taq DNA 
15 polymerase but not by Trie DNA polymerase. 

FIGURE 8 is an autoradiogram of the product ofPCR amplification of the 
upper and lower alleles of the D20S27 locus, using primers corresponding to these 
alleles, demonstrating nontemplated nucleotide addition (n+1) by Taq DNA 
polymerase but not by Tne DNA polymerase. 

20 FIGURE 9 is a composite of electropherogram gel scans of PCR 

amplifications at the DISS 153 (Figures 9A and 9B) and D15S127 loci (Figures 
9C and 9D), demonstrating nontemplated nucleotide addition (n+1 ) by Taq DNA 
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polymerase (Figures 9A and 9C) but not by Tne DNA polymerase (Figures 9B and 
9D). 

FIGURE 10A and B are composites of a electropherogram gel scan of 
PCR amplifications at D16S405 and D16S401 loci. 

5 FIGURE 11 is a composite of a electropherogram gel scan of PCR 

amplifications at D16S401 locus. 

FIGURE 12A and B are composites of a electropherogram gel scan of 
PCR amplifications at D15S127 and D15S153 loci. 

FIGURE 13 is a composite of a electropherogram gel scan of PCR 
10 amplifications at D16S401 locus. 



DETAILED DESCRIPTION OF THE INVENTION 

Definitions 

In the description that follows, a number of terms used in recombinant 
DNA technology are extensively utilized. In order to provide a clearer and 
15 consistent understanding of the specification and claims, including the scope to be 

given such terms, the following definitions are provided. 

Polymorphic. As is understood by one of ordinary skill in the art, a 
nucleic acid molecule is said to be "polymorphic" if it may exist in more than one 
form. For example, a nucleic acid molecule is said to be polymorphic if it may 
20 have more than one specific nucleotide sequence (such as degenerate nucleic acid 

molecules or genes that may each encode the same protein). More commonly, a 
nucleic acid molecule is said to be polymorphic if it displays size differences (i.e., 
differences in length), particularly when comparisons of nucleic acid molecules 
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from different individuals are made. Of course, other definitions of the term 
"polymorphic" will be apparent to one of ordinary skill and are also encompassed 
within this definition. 

Cloning vector. A plasihid, cosmid or phage DNA or other DNA 
molecule which is able to replicate autonomously in a host cell, and which is 
characterized by one or a small number of restriction endonuclease recognition 
sites at which such DNA sequences may be cut in a determinable fashion without 
loss of an essential biological function of the vector, and into which DNA may be 
spliced in order to bring about its replication and cloning. The cloning vector may 
further contain a marker suitable for use in the identification of cells transformed 
with the cloning vector. Markers, for example, are tetracycline resistance or 
ampicillin resistance. 

Recombinant host. Any prokaryotic or eukaryotic microorganism which 
contains the desired cloned genes in an expression vector, cloning vector or any 
DNA molecule. The term "recombinant host" is also meant to include those host 
cells which have been genetically engineered to contain the desired gene on the 
host chromosome or genome. 

Host. Any prokaryotic or eukaryotic microorganism that is the recipient 
of a replicable expression vector, cloning vector or any DNA molecule. The DNA 
molecule may contain, but is not limited to, a structural gene, a promoter and/or 
an origin of replication. 

Promoter. A DNA sequence generally described as the 5 1 region of a 
gene, located proximal to the start codon. At the promoter region, transcription 
of an adjacent gene(s) is initiated. 

Gene. A DNA sequence that contains information necessary for 
expression of a polypeptide or protein. It includes the promoter and the structural 
gene as well as other sequences involved in expression of the protein. 

Structural gene. A DNA sequence that is transcribed into messenger 
RNA that is then translated into a sequence of amino acids characteristic of a 
specific polypeptide. 
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Operably linked. As used herein "operably linked" means that the 
promoter is positioned to control the initiation of expression of the polypeptide 
encoded by the structural gene. 

Expression. Expression is the process by which a gene produces a 
polypeptide. It includes transcription of the gene into messenger RNA (mKNA) 
and the translation of such mRNA into polypeptide(s). 

Substantially Pure. As used herein "substantially pure" means that the 
desired purified protein is essentially free from contaminating cellular contaminants 
which are associated with the desired protein in nature. Contaminating cellular 
components may include, but are not limited to, phosphatases, exonucleases, 
endonucleases or undesirable DNA polymerase enzymes. 

Primer. As used herein "primer" refers to a single-stranded 
oligonucleotide that is extended by covalent bonding of nucleotide monomers 
during amplification or polymerization of a nucleic acid molecule. Minisatellite 
primers used for the amplification of minisatellite dimer, trimer, tetramer, etc., 
sequences are well-known in the art. 

Template. The term "template" as used herein refers to a double-stranded 
or single-stranded nucleic acid molecule which is to be amplified, synthesized or 
sequenced. In the case of a double-stranded DNA molecule, denaturation of its 
strands to form a first and a second strand is performed before these molecules 
may be amplified, synthesized or sequenced. A primer, complementary to a 
portion of a template is hybridized under appropriate conditions and the 
polymerase of the invention may then synthesize a molecule complementary to 
said template or a portion thereof. The newly synthesized molecule, according to 
the invention, may be equal or shorter in length than the original template. 
Mismatch incorporation or strand slippage during the synthesis or extension of the 
newly synthesized molecule may result in one or a number of mismatched base 
pairs. Thus, the synthesized molecule need not be exactly complementary to the 
template. 
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Incorporating. The term "incorporating 11 as used herein means becoming 
a part of a nucleic acid (e.g., DNA) molecule or primer. 

Amplification. As used herein "amplification" refers to any in vitro 
method for increasing the number of copies of a nucleotide sequence with the use 
5 of a DNA polymerase. Nucleic acid amplification results in the incorporation of 

nucleotides into a DNA molecule or primer thereby forming a new DNA molecule 
complementary to a DNA template. The formed DNA molecule and its template 
can be used as templates to synthesize additional DNA molecules. As used herein, 
one amplification reaction may consist of many rounds of DNA replication. DNA 
10 amplification reactions include, for example, polymerase chain reactions (PCR). 

One PCR reaction may consist of 5 to 100 "cycles" of denaturation and synthesis 
of a DNA molecule. 

Oligonucleotide. "Oligonucleotide" refers to a synthetic or natural 
molecule comprising a covalently linked sequence of nucleotides which are joined 
15 by a phosphodiester bond between the 3' position of the pentose of one nucleotide 

and the 5' position of the pentose of the adjacent nucleotide. 

Nucleotide. As used herein "nucleotide" refers to a base-sugar-phosphate 
combination. Nucleotides are monomelic units of a nucleic acid sequence (DNA 
and RNA). The term nucleotide includes deoxyribonucleoside triphosphates such 
20 as dATP, dCTP, dITP, dUTP, dGTP, dTTP, or derivatives thereof. Such 

derivatives include, for example, [aS]dATP, 7-deaza-dGTP and 7-deaza-dATP. 
The term nucleotide as used herein also refers to dideoxyribonucleoside 
triphosphates (ddNTPs) and their derivatives. Illustrated examples of 
dideoxyribonucleoside triphosphates include, but are not limited to, ddATP, 
25 ddCTP, ddGTP, ddlTP, and ddTTP. According to the present invention, a 

"nucleotide" may be unlabeled or detectably labeled by well known techniques. 
Detectable labels include, for example, radioactive isotopes, fluorescent labels, 
chemiluminescent labels, bioluminescent labels and enzyme labels. 

Thermostable. As used herein "thermostable" refers to a polymerase 
30 which is resistant to inactivation by heat. DNA polymerases synthesize the 
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formation of a DNA molecule complementary to a single-stranded DNA template 
by extending a primer in the 5-to-3' direction. This activity for mesophilic DNA 
polymerases may be inactivated by heat treatment. For example, T5 DNA 
polymerase activity is totally inactivated by exposing the enzyme to a temperature 
5 of 90°C for 30 seconds. As used herein, a thermostable polymerase activity is 

more resistant to heat inactivation than a mesophilic polymerase. However, a 
thermostable polymerase does not mean to refer to an enzyme which is totally 
resistant to heat inactivation and thus heat treatment may reduce the polymerase 
activity to some extent. A thermostable polymerase typically will also have a 
10 higher optimum temperature than mesophilic polymerases. 

Hybridization. The terms "hybridization" and "hybridizing" refers to the 
pairing of two complementary single-stranded nucleic acid molecules (RN A and/or 
DNA) to give a double-stranded molecule. As used herein, two nucleic acid 
molecules may be hybridized, although the base pairing is not completely 
15 complementary. Accordingly, mismatched bases do not prevent hybridization of 

two nucleic acid molecules provided that appropriate conditions, well known in 
the art, are used. In the present invention, the term "hybridization" refers 
particularly to hybridization of an oligonucleotide to a template molecule. 

3'-5' Exonuclease Activity. "T-S* exonuclease activity" is an enzymatic 
20 activity well known to the art. This activity is often associated with DNA 

polymerases, and is thought to be involved in a DNA replication "editing" or 
correction mechanism. 

A "DNA polymerase substantially reduced in 3*-5' exonuclease activity" 
(which may also be represented as "3'exo-") is defined herein as either (1) a 
25 mutated DNA polymerase that has about or less than 1 0%, or preferably about or 

less than 1%, of the 3'-5* exonuclease activity of the corresponding unmutated, 
wildtype enzyme, or (2) a DNA polymerase having a 3'-5' exonuclease specific 
activity which is less than about 1 unit/mg protein, or preferably about or less than 
0. 1 units/mg protein. A unit of activity of 3-5' exonuclease is defined as the 
30 amount of activity that solubilizes 10 nmoles of substrate ends in 60 min. at 37°C, 
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assayed as described in the "BRL 1989 Catalogue & Reference Guide", page 5, 
with Hhal fragments of lambda DNA 3'-end labeled with [>H]dTTP by terminal 
deoxynucleotidyl transferase (TdT). Protein is measured by the method of 
Bradford, Anal. Biochem. 72:248 (1976). As a means of comparison, natural, 
wildtype T5-DNA polymerase (DNAP) or T5-DNAP encoded by pTTQ19-T5-2 
has a specific activity of about 10 units/mg protein while the DNA polymerase 
encoded by pTTQ19-T5-2(exo) (U.S. Patent No. 5,270,179) has a specific 
activity of about 0.0001 units/mg protein, or 0.001% of the specific activity of the 
unmodified enzyme, a 10 5 -fold reduction. 

S f -3* Exonuclease Activity. "5-3 ! exonuclease activity" is also an 
enzymatic activity well known in the art. This activity is often associated with 
DNA polymerases, such as E. coli Poll and PolDI. 

A "DNA polymerase substantially reduced in 5-3' exonuclease activity" 
(which may also be represented as "5'exo-") is defined herein as either (1) a 
mutated DNA polymerase that has about or less than 10%, or preferably about or 
less than 1%, of the 5-3' exonuclease activity of the corresponding unmutated, 
wildtype enzyme, or (2) a DNA polymerase having 5-3' exonuclease specific 
activity which is less than about 1 unit/mg protein, or preferably about or less than 
0.1 units/mg protein. 

Both of the 3-5 1 and 5-3* exonuclease activities can be observed on 
sequencing gels. Active 5-3 1 exonuclease activity will produce nonspecific ladders 
in a sequencing gel by removing nucleotides from the 5-end of the growing 
primers. 3-5 1 exonuclease activity can be measured by following the degradation 
of radiolabeled primers in a sequencing gel. Thus, the relative amounts of these 
activities, e.g. by comparing wildtype and mutant polymerases, can be determined 
with no more than routine experimentation. 

Minisatellite DNA. As used herein, the term "minisatellite DNA" refers 
to a DNA fragment comprising a short stretch of tandemly repetitive nucleotide 
sequence. In vivo, minisatellite DNA fragments are found interspersed throughout 
the genomes of most eukaiyotic organisms thus far examined. These repeating 
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sequences appear in tandem and often in variable numbers within the genome; 
thus, the terms "short tandem repeats" ("STRs") or "variable numbers of tandem 
repeats" (" VNTRs") may be used synonymously when referring to these regions. 
Minisatellite DNA fragments are typically about 9 bases to about 60 bases in 
5 length and are repeated about 20-50 times at a typical locus in a eukaryotic 

genome. 

Microsatellite DNA. As used herein, the term "microsatellite DNA" 
refers to DNA fragments which are typically of a repeat unit size of about 1-6 
bases in length. The most prevalent of these microsatellite DNA fragments in the 

10 human genome is the dinucleotide repeat (dC-dA) n «(dG-dT) n (where n is the 

number of repetitions in a given stretch of nucleotides). The terms "STRs" and 
"VNTRs" may also be used synonymously to denote these structures. 

Non-templated 3 f Terminal Nucleotide Addition. As used herein, the 
term "non-templated 3' terminal nucleotide addition" or "extranucleotide addition" 

15 means the propensity of an enzyme such as a DNA polymerase to incorporate one 

or more additional nucleotides, which are not found in the template strand at the 
3' terminus of a newly synthesized nucleic acid molecule in a synthesis or 
amplification reaction, such as PCR. As a result of non-templated 3' terminal 
nucleotide addition, the synthesized or amplification products {i.e., the newly 

20 synthesized DNA strand) will be longer by one or more nucleotides than is the 

template, in such a fashion that if the template is "n" nucleotides in length, the 
synthesis or amplification products will be "n+l," "n+2," "n+3," etc., nucleotides 
in length. A "polymerase substantially reduced in the ability to add one or more 
non-templated nucleotides to the 3 1 terminus of a nucleic acid molecule" is defined 

25 herein as a DNA polymerase, which when it has no 3* exonuclease activity or has 

substantially reduced 3' exonuclease activity, it will produce a collection of 
amplification products in which less than about 50%, preferably less than about 
30%, more preferably less than about 20%, still more preferably less than about 
1 0%, still more preferably less than about 5%, and most preferably less than about 

30 1 % of the amplification products contain one or more non-templated nucleotides 
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at their 3' termini compared to amplification products produced by Taq DNA 
polymerase assayed under the same conditions. Preferably, the conditions used 
for assaying 3' non-templated nucleotide addition is performed such that less than 
100% of the amplification products of Taq DNA polymerase exhibits 3' non- 

5 templated nucleotide addition. Included in this definition are those polymerases 

that satisfy this definition for any primer set used. Thus, if the use of any primer 
set provides the indicated reduction of 3' non-templated nucleotide addition, the 
polymerase is said to be substantially reduced in the ability to add one or more 
non-templated nucleotides to the 3' terminus of a nucleic acid molecule. 

10 When referring to polymerases which have been mutated or modified to 

reduce or eliminate 3' non-templated nucleotide addition, the mutated or modified 
polymerase is said to be "reduced in the ability to add one or more non-templated 
nucleotides to the 3' terminus of a nucleic acid molecule" when the polymerase has 
a lower or reduced or eliminated ability to add non-templated 3 f nucleotides 

15 compared to the corresponding unmutated, unmodified or wildtype polymerase. 

For example, when testing the affect of a point mutation in the O-helix of a 
polymerase on non-templated nucleotide addition, the polymerase unmodified in 
the same position of the O-helix is preferably used for comparison purposes. Such 
mutated or modified polymerases are said to "substantially reduced in the ability 

20 to add one or more non-templated nucleotides to the 3' terminus of a nucleic acid 

molecule" if the mutated or modified polymerase has less than about 50%, 
preferably less than about 30%, more preferably less than about 20%, still more 
preferably less than about 1 0%, still more preferably less than about 5%, and most 
preferably less than about 1% of the activity for adding non-templated 3* terminal 

25 nucleotides compared to the corresponding unmutated, unmodified or wildtype 

polymerase. Preferably, the conditions used for assaying 3' non-templated 
nucleotide addition is performed such that less than 100% of the amplification 
products produced by the unmutated, unmodified or wildtype polymerase control 
exhibits 3' non-templated nucleotide addition. Included in this definition are those 
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mutant or modified polymerases that satisfy this definition for any primer set 
tested. 

The ability of a polymerase to add a non-templated 3 ! terminal nucleotide 
to the growing strand may be assessed by a variety of techniques, most preferably 
by gel electrophoresis of the synthesized or amplification products for a direct size 
comparison and by comparison to markers of known size (see Figures 6-13). 

Other terms used in the fields of recombinant DNA technology and 
molecular and cell biology as used herein will be generally understood by one of 
ordinary skill in the applicable arts. 

Sources of Polymerases 

The methods of the present invention rely on the use of polymerases 
(thermostable or mesophilic DNA or KNA polymerases) reduced, substantially 
reduced or eliminated in the ability to add one or more non-templated 3* terminal 
nucleotide to a growing nucleic acid strand. These thermostable DNA 
polymerases may be obtained from any strain of any thermophilic microorganism, 
including but not limited to strains of Thermus aquaticus (Taq polymerase; see 
U.S. Patent Nos. 4,889,818 and 4,965,188), Thermus thermophilus (Tth 
polymerase), Thermococcus litoralis (77/ or VENT™ polymerase), Pyrococcus 
fiiriosus (Pfu or DEEPVENT™ polymerase), Pyrococcus woosii (Pwo 
polymerase) and other Pyrococcus species, Bacillus sterothermophilus (Bst 
polymerase^, Sulfolobus acidocaldarius (Sac polymerase), Thermoplasma 
acidophilum (Tac polymerase), Bacillus caldophilus (Bca polymerase^, Thermus 
flaws (Tfl/Tub polymerase), Thermus ruber (Tru polymerase), Thermus 
brockianus (DYNAZYME™ polymerase), Thermotoga neapolitana (Tne 
polymerase; see WO 96/10640 and WO96/41014), Thermotoga maritima (Tma 
polymerase; see U. S. Patent No. 5,374,553) and other species of the Thermotoga 
genus (7^ polymerase) and Methanobacterium thermoautotrophicum (Mth 
polymerase). Mesophilic DNA polymerases of interest in the invention include but 
are not limited to T7 DNA polymerases, T5 DNA polymerase, DNA polymerase 
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III, Klenow fragment DNA polymerase and mutants, fragments or derivatives 
thereof. RNA polymerases such as T3, T5, SP6 and mutants, variants and 
derivatives thereof may also be used in accordance with the invention. 
Polymerases having reduced or substantially reduced ability to add a non- 

5 templated 3' nucleotide to a growing nucleic acid strand may be wildtype 

polymerases, or may be made by mutating such wildtype polymerases by standard 
techniques (for example, by generating point mutations, insertions, deletions, etc., 
in the wildtype gene or protein). Polymerases that are reduced or substantially 
reduced in the ability to add a non-templated 3' nucleotide to a growing strand 

10 may be identified by assaying the synthesized products (e.g. PCR products) 

formed by such enzymes, as is well-known in the art and as generally described 
below in the Examples. 

The nucleic acid polymerases used in the present invention may be 
mesophilic or thermophilic, and are preferably thermophilic. Preferred mesophilic 

15 DNA polymerases include T7 DNA polymerase, T5 DNA polymerase, Klenow 

fragment DNA polymerase, DNA polymerase III and the like. Preferred 
thermostable DNA polymerases that may be used in the methods of the invention 
include Taq, Tne, Tma, Pfu t Tfl, Tth, StofFel fragment, VENT™ and 
DEEPVENT™ DNA polymerases, and mutants, variants and derivatives thereof 

20 (U.S. Patent No. 5,436,149; U.S. Patent 4,889,818; U.S. Patent 4,965,188; U.S. 

Patent 5,079,352; U.S. Patent 5,614,365; U.S. Patent 5,374,553; U.S. Patent 
5,270,179; U.S. Patent 5,047,342; U.S. Patent No. 5,512,462; WO 92/06188; 
WO 92/06200; WO 96/10640; Barnes, W.M., Gene 772:29-35 (1992); Lawyer, 
F.C., etaL, PCRMeth. AppL 2:275-287 (1993); Flaman, J.-M, etaL, Nucl Acids 

25 Res. 22(1 5):3259-3260 ( 1 994)). For amplification of long nucleic acid molecules 

(e.g., nucleic acid molecules longer than about 3-5 Kb in length), at least two 
DNA polymerases (one substantially lacking 3' exonuclease activity and the other 
having 3' exonuclease activity) are typically used. See U.S. Patent No. 5,436,149; 
U.S. PatentNo. 5,5 12,462; Fames, W.M., Gene 112:29-35(1992); and copending 

30 U.S. Patent Application No. 08/689,8 14, filed February 14, 1997, the disclosures 
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of which are incorporated herein in their entireties. Examples of DNA 
polymerases substantially lacking in 3' exonuclease activity include, but are not 
limited to, Taq, 7>ie(exo*), 7ma(exo), Pfu (exo), Pwo(exo) and Tth DNA 
polymerases, and mutants, variants and derivatives thereof. 

Polypeptides having nucleic acid polymerase activity are preferably used 
in the present methods at a final concentration in solution of about 0.1-200 units 
per milliliter, about 0.1-50 units per milliliter, about 0.1-40 units per milliliter, 
about 0.1-3.6 units per milliliter, about 0.1-34 units per milliliter, about 0.1-32 
units per milliliter, about 0.1-30 units per milliliter, or about 0.1-20 units per 
milliliter, and most preferably at a concentration of about 20-40 units per milliliter. 
Of course, other suitable concentrations of nucleic acid polymerases suitable for 
use in the invention will be apparent to one or ordinary skill in the art. 

In a preferred aspect of the invention, polymerases of the invention and 
preferably the mutant or modified polymerases of the invention are made by 
recombinant techniques. A number of cloned polymerase genes are available or 
may be obtained using standard recombinant techniques. 

To clone a gene encoding a polymerase, which may be modified in 
accordance with the invention, isolated DNA which contains the polymerase gene 
is used to construct a recombinant library in a vector. Any vector, well known in 
the art, can be used to clone the DNA polymerase of interest. However, the 
vector used must be compatible with the host in which the recombinant DNA 
library will be transformed. 

Prokaryotic vectors for constructing the plasmid library include plasmids 
such as those capable of replication in E. coli such as, for example, pBR322, 
ColEl, pSClOl, pUC-vectors (pUC18, pUC19, etc.: In: Molecular Cloning, A 
Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 
New York (1982); and Sambrook el al y In: Molecular Cloning A Laboratory 
Manual (2d ed.) Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New 
York (1989)). Bacillus plasmids include pC194, pC221, pC217, etc. Such 
plasmids are disclosed by Glyczan, T. In: The Molecular Biology Bacilli, 
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Academic Press, York (1982), 307-329. Suitable Streptomyces plasmids include 
pIJlOl (Kendall et al, J. Bacteriol 169:4177-4183 (1987)). Pseudomonas 
plasmids are reviewed by John et al, (Rad Insec. Dis. 5:693-704 (1986)), and 
Igaki, (Jpn. J. Bacteriol 33:729-742 (1978)). Broad-host range plasmids or 
cosmids, such as pCP13 (Damns and Chakrabarbary, 1 Bacteriol 759:9-18, 
1984) can also be used for the present invention. The preferred vectors for 
cloning the genes of the present invention are prokaryotic vectors. Preferably, 
pCP13 and pUC vectors are used to clone the genes of the present invention. 

The preferred host for cloning the polymerase genes of interest is a 
prokaryotic host. The most preferred prokaryotic host is K coli. However, the 
desired polymerase genes of the present invention may be cloned in other 
prokaryotic hosts including, but not limited to, Escherichia, Bacillus, 
Streptomyces, Pseudomonas, Salmonella, Serratia, and Proteus. Bacterial hosts 
of particular interest include E. coli DH10B, which may be obtained from Life 
Technologies, Inc. (LTI) (Rockville, MD). 

Eukaryotic hosts for cloning and expression of the polymerases of interest 
include yeast, fungi, and mammalian cells. Expression of the desired polymerase 
in such eukaryotic cells may require the use of eukaryotic regulatory regions 
which include eukaryotic promoters. Cloning and expressing the polymerase gene 
in eukaryotic cells may be accomplished by well known techniques using well 
known eukaryotic vector systems. 

Once a DNA library has been constructed in a particular vector, an 
appropriate host is transformed by well known techniques. Transformed colonies 
are preferably plated at a density of approximately 200-300 colonies per petri dish. 
For thermostable polymerase selection, colonies are then screened for the 
expression of a heat stable DNA polymerase by transferring transformed E. coli 
colonies to nitrocellulose membranes. After the transferred cells are grown on 
nitrocellulose (approximately 1 2 hours), the cells are lysed by standard techniques, 
and the membranes are then treated at 95°C for 5 minutes to inactivate the 
endogenous E. coli enzyme. Other temperatures may be used to inactivate the 
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host polymerases depending on the host used and the temperature stability of the 
polymerase to be cloned. Stable polymerase activity is then detected by assaying 
for the presence of polymerase activity using well known techniques (see, e.g., 
Sagnere/a/., Gene 97:1 19-123 (1991), which is hereby incorporated by reference 
in its entirety). The gene encoding a polymerase of the present invention can be 
cloned using the procedure described by Sanger et al, supra. Other techniques 
for selecting cloned polymerases in accordance with the present invention will be 
well-known to those of ordinary skill in the art. 

Modifications or Mutations of Polymerases 

In accordance with the invention, the nucleotide binding domain of the 
polymerase of interest is modified or mutated in such a way as to produce a 
mutated or modified polymerase having reduced, substantially reduced or 
eliminated activity for adding non-templated 3' nucleotides. The O-helix region 
typically defines the nucleotide binding domain of DN A polymerases. The O-helix 
may be defined as RXXXKXXXFXXXYX (SEQ ID NO: 1 1), wherein X may be 
any amino acid. One or more mutations or combinations of mutations may be 
made in the O-helix of any polymerase in order to reduce or eliminate non- 
templated 3 1 nucleotide addition in accordance with the invention. Such mutations 
include point mutation, frame-shift mutations, deletions and insertions. Preferably, 
one or more point mutations, resulting in one or more amino acid substitutions, 
are used to produce polymerases having such activity. Such mutations may be 
made by a number of methods that will be familiar to one of ordinary skill, 
including but not limited to site-directed mutagenesis. In a preferred aspect of the 
invention, one or more mutations at positions R, K, F, and/or Y in the polymerase 
O-helix may be made to produced a polymerase having the desired activity. Most 
preferably, one or more mutations at position R and/or F and/or K and/or Y within 
the O-helix results in polymerases having reduced, substantially reduced or 
eliminated activity for adding non-templated 3' nucleotides. In the preferred 
aspect, amino acid substitutions are made at position R and/or F and/or K and/or 
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Y (or combinations thereof). Thus, R (Arg) and/or F (Phe) and/or K (Lys) may 
be substituted with any other amino acid including Ala, Arg, Asn, Asp, Cys, Gin, 
Glu, Gly, His, He, Leu, Lys, Met, Phe, Pro, Ser, Thr, Trp, Tyr, and Val. 
Preferably, R (Arg) is substituted with amino acids Lys, Tyr, Leu, His, Gin, Met, 
or Asn. F (Phe) is preferably substituted with amino acids Tyr, Ala, Leu, Thr, and 
Ser. K (Lys) is preferably substituted with amino acids Arg, Tyr, Leu, His, Gin, 
Met or Asn, and more preferably with Arg or His. Y (Tyr) is preferably 
substituted with amino acids Lys, Arg, Ala, Thr, Phe, Leu, His, Gin, Met, or Asn. 
Positions corresponding to R, K, F and Y for RNA polymerases may also be 
determined by comparing nucleotide and/or amino acid sequences with those of 
DNA polymerases, to determine homologies therebetween. Corresponding 
mutations or modification may then be made to produce the desired result in any 
RNA polymerase. 

The O-helix has been identified and defined for a number of polymerases 
and may be readily identified for other polymerases by one with skill in the art. 
Thus, given the defined O-helix region and the methods and assays described 
herein, one with skill in the art can make one or a number of modifications which 
would result in polymerases having reduced, substantially reduced or eliminated 
activity for adding non-templated 3 1 nucleotides. Accordingly, the invention relates 
to methods for producing such polymerases having modifications in the O-helix 
domain resulting in reduction, substantial reduction or elimination of activity for 
adding non-templated 3' nucleotides, methods for producing nucleic acid 
molecules encoding such polymerases, and polymerases and nucleic acid 
molecules produced by such methods. 

The following table illustrates identified O-helix regions for known 
polymerases. 
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Polymerase 


O-Helix F*pinn 


SEO ID NO. 


Poll 


754 RRSAKAINFGLIYG 


12 


Taq 


659 RRAAKTINFGVLYG 


13 


T7 


5 1 8 RDN AKTFTYGFLYG 


14 


1 Tm 


722 RRVGKMVNFSnYG 


15 


T5 


588 RQAAKAITFGILYG 


16 


| Tma 


722 RRAGKMVNFSIIYG 


17 



Thus, in accordance with a preferred aspect of the invention, 
corresponding mutations in the R and/or F and/or K positions of the O-helix can 
be made for the following enzymes based on the tables below. 



Polymerase 


Mutation Position 


Poll 


Arg 7 * 


T5 


Arg 5 " 


T7 


Arg J » 


Taq 


Arg* 59 


Tne 


Arg™ 


Tma 


Arg™ 


Bca 


Arg 70S 


Bst 


Arg 702 


1 Tth 


Are 661 




Polymerase 


Mutation Position _| 


Poll 


Phe 762 


T5 


Phe 596 


T7 


Phe 528 


Taq 


Phe 667 


Tne 


Phe 730 


Tma 


Phe 730 


Bca 


Phe 713 


Bst 


Phe 710 


Tth 


Phe 669 
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Polymerase 


Mutation Position 


roll 




T5 


Lys 592 


T7 


Lys 522 


Tag 


Lys 662 


Tne 


Lys 726 


Tma 


Lys 726 


Bca 


Lys 707 


Bst 


Lys 706 


Tth 


Lys 665 



The mutation position of Arg 705 for Bca is based on the sequence 
information in GenBank. It should be noted, however, that according to the 
sequence described by Vemori etal 1 Biochem. (Japan) 113 AO 1 -41 0 ( 1 993), the 
position of Arg in Bca is 703. 

Additional Modifications or Mutations of Polymerases 

In accordance with the invention, in addition to the mutations or 
modifications described above, one or more additional mutations or modifications 
(or combinations thereof) may be made to the polymerases of interest. Mutations 
or modifications of particular interest include those modifications of mutations 
which (1) reduce or eliminate 3' to 5' exonuclease activity; and (2) reduce or 
eliminate 5* to 3' exonuclease activity. 

If the DNA polymerase has 3'-to-5' exonuclease activity, this activity may 
be reduced, substantially reduced, or eliminated by mutating the polymerase gene. 
Such mutations include point mutations, frame shift mutations, deletions and 
insertions. Preferably, the region of the gene encoding the 3'-to-5' exonuclease 
activity is mutated or deleted using techniques well known in the art (Sambrook 
et al. 9 (1989) in: Molecular Cloning, A Laboratory Manual (2nd Ed.) y Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, NY). 



WO 98/35060 



PCT/US98/02791 



-34- 



The 3'-to-5 ! exonuclease activity can be reduced or impaired by creating 
site specific mutants within the 3'-> 5' exonuclease domain. See infra. In a specific 
embodiment of the invention Asp 323 of Tne DNA polymerase is changed to any 
amino acid, preferably to Ala 323 to substantially reduce exonuclease activity. 
In another specific embodiment of the invention, Asp 323 of Tma may be changed 
to any other amino acid, preferably to Ala to substantially reduce 3'-+? 
exonuclease activity. The following represents a domain of interest for a number 
of polymerases for preparing 3^5' exonuclease mutants. 



Tne 318 PSFALDLETSS 

Pol I 350 PVFAFDTETDS 

T5 159 GPVAFDSETSA 

T7 1 MIVSDEEANA 



328 (SEQIDNO:18) 

360 (SEQIDNO:19) 

169 (SEQIDNO:20) 

10 (SEQIDNO:21) 



Mutations, such as insertions, deletions and substitutions within the above 
domain can result in substantially reduced 3'->5' exonuclease activity. By way of 
example, Asp 355 (Poll), Asp 164 (T5), and Asp 5 (T7) may be substituted with any 
amino acid to substantially reduce 3'-+5 ! exonuclease activity. For example, Asp 
at these positions may be substituted with Ala. 

The y->V exonuclease activity of the polymerases can be reduced, 
substantially reduced or eliminated by mutating the polymerase gene or by deleting 
the 5' to 3' exonuclease domain. Such mutations include point mutations, frame 
shift mutations, deletions, and insertions. Preferably, the region of the gene 
encoding the 5*-*y exonuclease activity is deleted using techniques well known 
in the art. In embodiments of this invention, any one of six conserved amino acids 
that are associated with the 5^3* exonuclease activity can be mutated. Examples 
of these conserved amino acids with respect to Tne DNA polymerase include 
Asp 8 , Glu 112 , Asp u \ Asp 115 , Asp 137 , and Asp 139 . Other possible sites for mutation 
areGly l02 ,Gly 187 andGly 195 . 
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Corresponding amino acid to target for other polymerases to reduce or 
eliminate 5'-*3' exonuclease activity as follows: 

E. coli poll: Asp 13 , Glu 113 , Asp 115 , Asp 116 , Asp 138 , and Asp 140 * 
Taq pol: Asp 18 , Glu 117 , Asp 119 , Asp 120 , Asp 142 , and Asp 144 
5 Tma pol: Asp 8 , Glu 112 , Asp 114 , Asp 115 , Asp 137 , and Asp 139 

Amino acid residues of Taq DNA polymerase are as numbered in U.S. 
5,079,3 52. Amino acid residues oiThermotoga maritima {Tma) DNA polymerase 
are numbered as in U.S. Patent No. 5,374,553. 

Examples of other amino acids which may be targeted for other 
10 polymerases to reduce S 1 -^ 1 exonuclease activity include: 



Enzyme or source 


Mutation positions 


Streptococcus pneumoniae 


Asp 10 , Glu" 4 , Asp 116 , Asp 117 , Asp 139 , Asp 141 


Thermus flaws 


Asp", Glu" 6 , Asp 118 , Asp 119 , Asp ,4 \ Asp 143 


Thermus thermophilus 


Asp 18 , Glu" 8 , Asp 120 . Asp 121 , Asp 143 , Asp ,4$ 


Deinococcus radiodurans 


Asp 18 . Glu" 7 , Asp 119 , Asp 120 , Asp 142 , Asp 144 


Bacillus caldotenax 


Asp 9 , Glu 109 , Asd 111 . Asp 112 , Asp 134 , Asp 136 



Coordinates of S. pneumoniae, T. flaws, D. radiodurans, B. caldotenax 
were obtained from Gutman and Minton. Coordinates of T thermophilus were 
obtained from International Patent No. WO 92/06200. 

20 Typically, the mutant polymerases of the invention can be affected by 

substitution of amino acids typically which have different properties. For example, 
an acidic amino acid such as Asp may be changed to a basic, neutral or polar but 
uncharged amino acid such as Lys, Arg, His (basic); Ala, Val, Leu, lie, Pro, Met, 
Phe, Trp (neutral); or Gly, Ser, Thr, Cys, Tyr, Asn or Gin (polar but uncharged). 

25 Glu may be changed to Asp, Ala, Val Leu, lie, Pro, Met, Phe, Trp, Gly, Ser, Thr, 

Cys, Tyr, Asn or Gin. 
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Preferably, oligonucleotide directed mutagenesis is used to create the 
mutant polymerases which allows for all possible classes of base pair changes at 
any determined site along the encoding DNA molecule. In general, this technique 
involves annealing a oligonucleotide complementary (except for one or more 
mismatches) to a single stranded nucleotide sequence coding for the DNA 
polymerase of interest The mismatched oligonucleotide is then extended by DNA 
polymerase, generating a double stranded DNA molecule which contains the 
desired change in the sequence on one strand. The changes in sequence can of 
course result in the deletion, substitution, or insertion of an amino acid. The 
double stranded polynucleotide can then be inserted into an appropriate expression 
vector, and a mutant polypeptide can thus be produced. The above-described 
oligonucleotide directed mutagenesis can of course be carried out via PCR. 

Enhancing Expression of Polymerases 

To optimize expression of the polymerases of the present invention, 
inducible or constitutive promoters are well known and may be used to express 
high levels of a polymerase structural gene in a recombinant host. Similarly, high 
copy number vectors, well known in the art, may be used to achieve high levels 
of expression. Vectors having an inducible high copy number may also be useful 
to enhance expression of the polymerases of the invention in a recombinant host. 

To express the desired structural gene in a prokaryotic cell (such as, 
K coli, B. subtilis, Pseudomonas, etc.), it is necessary to operably link the desired 
structural gene to a functional prokaryotic promoter. However, the natural 
promoter of the polymerase gene may function in prokaryotic hosts allowing 
expression of the polymerase gene. Thus, the natural promoter or other 
promoters may be used to express the polymerase gene. Such other promoters 
may be used to enhance expression and may either be constitutive or regulatable 
(i.e., inducible or derepressible) promoters. Examples of constitutive promoters 
include the int promoter of bacteriophage X, and the bla promoter of the 
^-lactamase gene of pBR322. Examples of inducible prokaryotic promoters 
include the major right and left promoters of bacteriophage X (P R and PJ, trp 7 
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reck, lacZ y lac\ tet y gal, trc, and tac promoters of E. coli. The B. subtilis 
promoters include a-amylase (Ulmanen etal, J. Bacteriol 162:176-182 (1985)) 
and Bacillus bacteriophage promoters (Gryczan, T., In: The Molecular Biology 
Of Bacilli, Academic Press, New York (1982)). Streptomyces promoters are 

5 described by Ward et a/., Mol Gen. Genet. 203:468478 (1986)). Prokaryotic 

promoters are also reviewed by Glick, J. Ind Microbiol. 1:277-282 (1987); 
Cenatiempto, Y. , Biochimie 68 :505-5 16(1 986); and Gottesman, Ann. Rev. Genet, 
j 8 A 1 5-442 (1 984). Expression in a prokaryotic cell also requires the presence of 
a ribosomal binding site upstream of the gene-encoding sequence. Such ribosomal 

10 binding sites are disclosed, for example, by Gold et al. 3 Ann. Rev. Microbiol. 

35:365404(1981). 

To enhance the expression of polymerases of the invention in a eukaryotic 
cell, well known eukaryotic promoters and hosts may be used. Preferably, 
however, enhanced expression of the polymerases is accomplished in a prokaryotic 

15 host. The preferred prokaryotic host for overexpressing the polymerases of the 

invention is E. coli. 



Isolation and Purification of Polymerases 

The enzyme(s) of the present invention is preferably produced by 
fermentation of the recombinant host containing and expressing the desired 

20 polymerase gene. However, the polymerases of the present invention may be 

isolated from any strain which produces the polymerase of the present invention. 
Fragments of the polymerase are also included in the present invention. Such 
fragments include proteolytic fragments and fragments having polymerase activity. 
Any nutrient that can be assimilated by a host containing the polymerase 

25 gene may be added to the culture medium. Optimal culture conditions should be 

selected case by case according to the strain used and the composition of the 
culture medium. Antibiotics may also be added to the growth media to insure 
maintenance of vector DNA containing the desired gene to be expressed. Media 
formulations have been described in DSM or ATCC Catalogs and Sambrook et 
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a/., In: Molecular Cloning, a Laboratory Manual (2nd ed.), Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, NY (1989). 

Host cells producing the polymerases of this invention can be separated 
from liquid culture, for example, by centrifugation. In general, the collected 
microbial cells are dispersed in a suitable buffer, and then broken down by 
ultrasonic treatment or by other well known procedures to allow extraction of the 
enzymes by the buffer solution. After removal of cell debris by ultracentrifugation 
or centrifugation, the polymerase can be purified by standard protein purification 
techniques such as extraction, precipitation, chromatography, affinity 
chromatography, electrophoresis or the like. Assays to detect the presence of the 
polymerase during purification are well known in the art and can be used during 
conventional biochemical purification methods to determine the presence of these 
enzymes. 

Thermotoga Polymerases 

Thermotdga polymerases for use in the present invention are obtained from 
any strain of Thermotoga species, more preferably from a strain of Thermotoga 
neapolitana (WO 96/10640 or WO96/41014) or Thermotoga maritima (U.S. 
Patent No. 5,374,553). Enzymes suitable for use in the present invention from 
these more preferred sources are the wildtype DNA polymerases (Tne from T 
neapolitana; Tma from 7! maritima\ or mutants or derivatives thereof 

The present invention provides isolated nucleic acid molecules encoding 
preferred mutant Tne DNA polymerases, mutant Tne DNA polymerases encoded 
by such isolated nucleic acid molecules, and specific mutant Tne DNA polymerase 
proteins. Most preferred are the wildtype Tne DNA polymerase (SEQ ID 
NOs:l,2), the wildtype Tma DNA polymerase (U.S. Patent No. 5,374,553), and 
the following mutants of Tne DNA polymerase: Tne N , A219, D323A (SEQ ID 
NO:3); Tne N'A283, D323A (SEQ ID NO:4); Tne NA192, D323A (SEQ ID 
NO:5); 7wD137A,D323A(SEQIDNO:6); 7«eD8A,D323A(SEQIDNO:7); 
Tne G195D, D323A (SEQ ID NO:8); Tne G37D, D323A (SEQ ID NO:9); 
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Tne N'A283 (SEQ ID NO:10); Tne D137A, D323A, R722K; Tne D137A, 
D323A, R722Y; Tne D137A, D323A, R722L; Tne D137A, D323A, R722H; 
Tne D137A, D323A, R722Q; 7weD137A, D323A, F730Y; Tne D137A, D323A, 
K726R; Tne D137A, D323A, K726H; Tne D137A, D323A, R722K, F730Y; 

5 Tne D137A, D323A, R722K, K726R; Tne D137A, D323A, R722K, K726H; 

Tne D137A, D323A, R722H, F730Y; Tne D137A, D323A, R722H, K726R 
Tne D137A, D323A, R722H, K726H; Tne D137A, D323A, R722Q, F730Y; 
Tne D137A, D323A, R722Q, K726R; Tne D137A, D323A, R722Q, K726H; 
Tne D137A, D323A, R722N, F730Y; Tne D137A, D323A, R722N, K726R 

10 Tne D137A, D323A, R722N, K726H; Tne D137A, D323A, F730S; Tne N*A283, 

D323A, R722K7H/Q/N/Y/L; Tneli'Ml9, D323A, R722K; 7«e>PA219 > D323A, 
F730Y; 7>ieN , A219, D323A.K726R; 7«eKA219, D323A, K726H; TneDWA, 
D323A, F730S, R722K7Y/Q/N/H/L, K726R/H; 7>ie D137A, D323A, F730T, 
R722K/Y/Q/N/H/L, K726R/H; 7we D137A, D323A, F730T; Tne F730S; J»e 

15 F730A; Tne K726R; 7he K726H; and Tne D137A, D323A, R722N. It will of 

course be understood by the skilled artisan that the designations of the above- 
described mutant polymerases indicate the position of the amino acid residue in 
the wildtype amino acid sequence (SEQ ID NO:2) that is being mutated, as well 
as to what residue the amino acid is being mutated. Thus, for example, "D137A" 

20 indicates that the Asp (D) residue at position 137 in SEQ ID NO:2 has been 

mutated to an Ala (A) residue, and, for example, H R722K7Y/Q/N/H/L" indicates 
that the Arg (R) residue at position 722 in SEQ ID NO:2 has been mutated to a 
Lys (K), Tyr (Y), Gin (Q), Asn (N), His (H) or Leu (L) residue. Mutant 
polymeraes having one or more mutations or modifications corresponding to the 

25 Tne mutants of the invnetion are also contemplated by the invention. 

The following chart indicates the nucleic acid sequences of the nucleic 
acid molecules encoding the above-described mutant Tne DN A polymerases (SEQ 
ID NOs:3-10), each with reference to the wildtype Tne DNA polymerase (SEQ 
IDNOrl): 
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SEQID 
NO: 


Deletion of SEQID NO: 
1 


Insertion 


Substitution to SEQ ID NO: 
1 


3 


Deletion of positions 1* 
657 from the 5'-end 


ATG AGC TTC 
atthe5 v -end 


A replaces Gat 
position 966; C replaces 
A at 968 and G replaces C 
at 969 


4 


Deletion of positions 1- 
849 from the 5'-end 


None 


A replaces Gat 966; C 
replaces A at 968 and G 
replaces C at 969 


5 


Deletion of positions 1- 
576 from the 5'-end 


ATGAATTCG 
AGC TCG GTA 
CCCatthe 5'- 
end 


A replaces G at 966; C 
replaces A at 968 and G 
replaces C at 969; A 
replaces G at 584 


6 


None 


None 


A replaces G at 966; C 
replaces A at 968 and G 
replaces C at 969; C 
replaces T at 408 and C 
replaces A at 410 


7 


None 


None 


A replaces G at 966; C 
replaces A at 968 and G 
replaces C at 969; C 
replaces A at 23 and C 
replaces T at 24 


8 


None 


None 


A replaces G at 966; C 
replaces A at 968 and G 
replaces C at 969; T 
replaces C at 576 and A 
replaces G at 584 


9 


None 


None ■ 


A replaces G at 966; C 
replaces A at 968 and G 
replaces C at 969; A 
replaces Gat 110 


10 


Deletion of positions 1- 
849 from the 5'-end 


None 


None 



Using these same approaches, the sequence guidance provided herein, and 
knowledge of appropriate nucleotide substitutions to be made to SEQ ED NO: 1 , 
one of ordinary skill can readily produce other nucleic acid molecules encoding 
mutant polymerases, such as those described in detail above, having the desired 
activity. In addition, other nucleic acid molecules which comprise a sequence 
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substantially different from those described above but which, due to the 
degeneracy of the genetic code, still encode a mutant Tne DNA polymerase having 
an amino acid sequence set forth above, are also encompassed by the present 
invention. Since the genetic code is well known in the art, it is routine for one of 
5 ordinary skill in the art to produce such mutants and degenerate variants without 

undue experimentation. 

Each of these mutant Tne DNA polymerases are reduced or substantially 
reduced in the ability to add a non-templated 3' terminal nucleotide to the growing 
strand. These mutant Tne DNA polymerase proteins may be prepared by 

10 recombinant DNA techniques routine to one of ordinary skill. Preferably, such 

mutant Tne polymerases are prepared by inserting an isolated DNA molecule 
having a nucleotide sequence as described above for each individual mutant into 
a recombinant vector, inserting the vector into a host cell, preferably an 
Escherichia coli cell, and culturing the host cell under conditions favoring the 

15 production of the mutant Tne DNA polymerase. The mutant Tne polymerase is 

then isolated from the host cell according to standard protein purification 
techniques. Further guidance for the preparation and isolation of mutant DNA 
polymerases from thermostable microorganisms can be found, for example, in U.S. 
Patent No. 5,374,553, in co-pending U.S. Patent Application No. 08/689,818 of 

20 Deb K. Chatterjee and A. John Hughes, entitled "Cloned DNA Polymerases from 

Thermotoga and Mutants Therof," filed September 6, 1996, and in co-pending 
U.S. Patent Application No. 08/689,807 of Deb K. Chatterjee, entitled "Cloned 
DNA Polymerases from Thermotoga and Mutants Therof," filed September 6, 
1996, the disclosures of all of which are incorporated herein in their entirety. 

25 In the methods of the present invention, Thermotoga DNA polymerases 

substantially reduced in 3 -5' exonuclease activity (such as a Tne mutant having an 
amino acid sequence as set forth in any one of SEQ ID NOs:3-9), or Thermotoga 
DNA polymerases not substantially reduced in 3-5' exonuclease activity (such as 
Tne DNA polymerase (SEQ ID NOs: 1 ,2), Tma DNA polymerase (U. S . Patent No. 

30 5,374,553), or the Tne mutant 7^6^283 (SEQ ID NO: 10)), may be used with 
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similar results, since both types oTThermotogaDNA polymerase are substantially 
reduced in the ability to add a nontemplated 3' terminal nucleotide to a DNA 
template. Other thermostable DNA polymerases substantially reduced in 3-5* 
exonuclease activity, such as Taq, VENT™(exo-), DEEPVENT™(exo-), 

5 Dtok(exo-) and THERMOLASE™ Tbr, are not preferred for use in the present 

methods as they will add non-templated nucleotides to the 3' termini of the 
amplification products as described below. However, such thermostable 
polymerase can be made which have reduced, substantially reduced or eliminated 
activity to add 3' non-template nucleotides by mutating or modifying the 

10 polymerase in accordance with the invention. The preferred Thermotoga 

polymerases of the invention contain such mutations or modifications in their 
O-helix. 

The recombinant host comprising the gene encoding Tne DNA 
polymerase, E. coli DH10B(pUC-Tne), was deposited on September 30, 1994, 

15 with the Collection, Agricultural Research Culture Collection (NRRL), 1815 

North University Street, Peoria, Illinois 61604 USA, as Deposit No. NRRL 
B-21238. The gene encoding Tma DNA polymerase has also been cloned and 
sequenced (U.S. Patent No. 5,374,553, which is expressly incorporated by 
reference herein in its entirety). Methods for preparing mutants and derivatives 

20 of these Tne and Tma polymerases are well-known in the art, and are specifically 

described in co-pending U.S. Patent Application No. 08/689,818 of Deb K. 
Chatterjee and A. John Hughes, entitled "Cloned DNA Polymerases from 
Thermotoga and Mutants Therof," filed September 6, 1996, and co-pending U.S. 
Patent Application No. 08/689,807 of Deb K. Chatterjee, entitled "Cloned DNA 

25 Polymerases from Thermotoga and Mutants Therof, " filed September 6, 1 996, the 

disclosures of which are incorporated herein in their entirety. 

Advantages of Thermostable Polymerases 

The use of thermostable polymerases (e.g. Thermotoga polymerases) or 
mutants or derivatives thereof in the methods of the present invention provide 
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several distinct advantages. These advantages are particularly apparent in the 
application of the present methods to analysis and typing of minisatellite, 
microsatellite and STR DNA regions. 

With respect to traditional thermolabile DNA polymerases used in DNA 
amplification and sequencing, such as T4, T7 or K coli Klenow fragment 
polymerases, thermostable polymerases such as Thermotoga DNA polymerases 
maintain their enzymatic activity in the multiple high-temperature cycles used in 
PCR and analogous automated amplification methodologies. It is therefore 
unnecessary to add fresh enzyme at the beginning of each amplification cycle when 
using thermostable polymerases, as must be done when thermolabile enzymes are 
used. 

With respect to other thermostable enzymes, it has been unexpectedly 
discovered in the present invention (as described in more detail in the Examples 
below) that the use of Tne or Tma DNA polymerase mutants or derivatives 
thereof, does not result in the incorporation of non-templated 3' nucleotides into 
the newly synthesized DNA strands during DNA amplification reactions. This 
non-templated incorporation is a common problem when using certain other 
commonly employed thermostable enzymes, such as Taq, VENT™(exo-), 
DEEPVENT™(exo-), Dtok(exo-) and THERMOLASE™ Tbr. It has also been 
unexpectedly discovered that mutants of these polymerases can be made to reduce 
or eliminate addition of non-templated 3' nucleotides. In particular, such 
mutations are preferably made within the O-helix of such polymerases. 

Thus, the use of Tne or Tma DNA polymerases or mutants or derivatives 
thereof (or other mutant polymerases produced according to the invention) in 
amplifying and typing DNA sequences, particularly hypervariable DNA sequences 
such as minisatellite, microsatellite or STR regions, will allow a faithful 
amplification and resolution of polymorphisms in these regions. This faithful 
resolution is not possible using other thermostable polymerases due to their 
propensity for non-templated incorporation. Thus, these enzymes are suitable for 
use in automated amplification systems such as PCR 
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Sources of DNA 

Suitable sources of DNA, including a variety of cells, tissues, organs or 
organisms, may be obtained through any number of commercial sources (including 
American Type Culture Collection (ATCC), Rockville, Maryland; Jackson 
Laboratories, Bar Harbor, Maine; Cell Systems, Inc., Kirkland, Washington; 
Advanced Tissue Sciences, La Jolla, California). Cells that may be used as 
starting materials for genomic DNA preparation are preferably eukaryotic 
(including fungi or yeasts, plants, protozoans and other parasites, and animals 
including humans and other mammals). Although any mammalian cell may be 
used for preparation of DNA, preferred are blood cells (erythrocytes and 
leukocytes), endothelial cells, epithelial cells, neuronal cells (from the central or 
peripheral nervous systems), muscle cells (including myocytes and myoblasts from 
skeletal, smooth or cardiac muscle), connective tissue cells (including fibroblasts, 
adipocytes, chondrocytes, chondroblasts, osteocytes and osteoblasts) and other 
stromal cells (e.g., macrophages, dendritic cells, Schwann cells), although other 
cells, including the progenitors, precursors and stem cells that give rise to the 
above-described somatic cells, are equally suitable. Also suitable for use in the 
preparation of DNA are mammalian tissues or organs such as those derived from 
brain, kidney, liver, pancreas, blood, bone marrow, muscle, nervous, skin, 
genitourinary, circulatory, lymphoid, gastrointestinal and connective tissue 
sources, as well as those derived from a mammalian (including human) embryo or 
fetus. These cells, tissues and organs may be normal, or they may be pathological 
such as those involved in infectious diseases (caused by bacteria, fungi or yeast, 
viruses (including ADDS) or parasites), in genetic or biochemical pathologies (e.g. , 
cystic fibrosis, hemophilia, Alzheimer's disease, schizophrenia, muscular dystrophy 
or multiple sclerosis), or in cancerous processes. 

More specifically, in one aspect of the invention, the relationship between 
a first individual and a second individual may be determined by analyzing and 
typing a particular polymorphic DNA fragment, such as a minisatellite or 
microsatellite DNA sequence. In such a method, the amplified fragments for each 
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individual are compared to determine similarities or dissimilarities. Such an 
analysis is accomplished, for example, by comparing the size of the amplified 
fragments from each individual, or by comparing the sequence of the amplified 
fragments from each individual. In another aspect of the invention, genetic 

5 identity can be determined. Such identity testing is important, for example, in 

paternity testing, forensic analysis, etc. In this aspect of the invention, a sample 
containing DNA (e.g., a crime scene sample or a sample from an individual) is 
analyzed and compared to a sample from one or more individuals. In one such 
aspect of the invention, one sample of DNA may be derived from a first individual 

10 and another sample may be derived from a second individual whose relationship 

to the first individual is unknown; comparison of these samples from the first and 
second individuals by the methods of the invention may then facilitate a 
determination of the genetic identity or relationship between the first and second 
individual. In a particularly preferred such aspect, the first DNA sample may be 

15 a known sample derived from a known individual and the second DNA sample 

may be an unknown sample derived, for example, from crime scene material. In 
an additional aspect of the invention, one sample of DNA may be derived from a 
first individual and another sample may be derived from a second individual who 
is related to the first individual; comparison of these samples from the first and 

20 second individuals by the methods of the invention may then facilitate a 

determination of the genetic kinship of the first and second individuals by allowing 
examination of the Mendelian inheritance, for example, of a polymorphic, 
minisatellite, microsatellite or STR DNA fragment. In another aspect of the 
invention, DNA fragments important as genetic markers for encoding a gene of 

25 interest can be identified and isolated. For example, by comparing samples from 

different sources, DNA fragments which may be important in causing diseases 
such as infectious diseases (of bacterial, fungal, parasitic or viral etiology), cancers 
or genetic diseases, can be identified and characterized. In this aspect of the 
invention a DNA sample from normal cells or tissue is compared to a DNA sample 

30 from diseased cells or tissue. Upon comparison according to the invention, one 
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or more unique polymorphic fragments present in one DNA sample and not 
present in the other DNA sample can be identified and isolated. Identification of 
such unique polymorphic fragments allows for identification of sequences 
associated with, or involved in, causing the diseased state. 

Once the starting cells, tissues, organs or other samples are obtained, DNA 
may be prepared therefrom by methods that are well-known in the art (See, e.g., 
Maniatis, T., et aL, Molecular Cloning: A Laboratory Manual, Cold Spring 
Harbor, New York: Cold Spring Harbor Laboratory Press, pp. 9.16-9.23 (1989); 
Kaufman, P.B., et aL, Handbook of Molecular and Cellular Methods in Biology 
and Medicine, Boca Raton, Florida: CRC Press, pp. 1-26 (1995)). The DNA 
samples thus prepared may then be used to identify, analyze and type polymorphic 
DNA fragments, including minisatellite, microsatellite and STR DNA fragments, 
by amplification, preferably by PCR amplification, as modified by the methods of 
the present invention. 

General methods for amplification and analysis of DNA fragments are 
well-known to one of ordinary skill in the art (see, e.g., U.S. Pat. Nos. 4,683, 195; 
4,683,202; and 4,800,159; Innis, M.A., et al., eds., PCR Protocols: A Guide to 
Methods and Applications, San Diego, California: Academic Press, Inc. (1990); 
Griffin, H.G., and Griffin, A.M., eds., PCR Technology: Current Innovations, 
Boca Raton, Florida: CRC Press (1994)). Typically, these methods comprise 
contacting the DNA sample with a thermostable DNA polymerase in the presence 
of one or more primer sequences, amplifying the DNA sample to generate a 
collection of amplified polymorphic, minisatellite, microsatellite or STR DNA 
fragments, preferably by PCR or equivalent automated amplification technique, 
separating the amplified DNA fragments by size, preferably by gel electrophoresis, 
and analyzing the gels for the presence of polymorphic, minisatellite, microsatellite 
or STR DNA fragments by direct comparison of the pattern of fragments 
generated from a first sample of DNA to those from a second sample of DNA, or 
by a more indirect comparison using known size markers. 
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As noted above, amplification protocols used heretofore for analyzing and 
typing polymorphic DNA fragments, particularly minisatellite, microsateliite or 
STR DNA sequences, use certain thermostable DNA polymerases such as Taq 
(U.S. Patent Nos. 4,683,195; 4,683,202; and 4,800,159). However, as discussed 
in detail above, these approaches yield amplification products in which one or 
more non-templated nucleotides is added to the 3' termini of the products by the 
polymerases, thus leading to heterogeneity in the amplification products, and 
ambiguity concerning the correct size of the amplification products. 

This problem is overcome in the present invention by contacting the DNA 
sample in the amplification reaction mixtures with one or more DNA polymerases 
of the invention which are reduced, substantially reduced or eliminated in the 
ability to add a nontemplated 3 1 terminal nucleotide to the growing strand. 
Preferably, such DNA polymerases are Thermotoga DNA polymerases, more 
preferably a Thermotoga DNA polymerase substantially reduced in 3'-5* 
exonuclease activity, still more preferably a Tne polymerase (SEQ ID NOs:l,2), 
a Tma polymerase (U.S. Patent No. 5,374,553), or a mutant or derivative thereof, 
and most preferably one of the following mutants of Tne polymerase: 
Tne NA219, D323A (SEQ ID NO:3); Tne NA283, D323A (SEQ ID NO:4); 
Tne NA192, D323A (SEQ ID NO:5); Tne D137A, D323A (SEQ ID NO:6); 
Tne D8A, D323A (SEQ ID NO:7); Tne G195D, D323A (SEQ ID NO:8); Tne 
G37D, D323A (SEQ ID NO:9); Tne N»A283 (SEQ ID NO:10); Tne D137A, 
D323A, R722K; Tne D137A, D323A, R722Y; Tne D137A, D323A, R722L; 
7weD137A,D323A,R722H; 7heD137A,D323A,R722Q; 7heD137A,D323A, 
F730Y; 7>«?D137A,D323A,K726R; 7w*D137A,D323A,K726H; 7heD137A, 
D323A, R722K, F730Y; Tne D137A, D323A, R722K, K726R; Tne D137A, 
D323A, R722K, K726H; Tne D137A, D323A, R722H, F730Y; Tne D137A, 
D323A, R722H, K726R; Tne D137A, D323A, R722H, K726H; Tne D137A, 
D323A, R722Q, F730Y; Tne D137A, D323A, R722Q, K726R; Tne D137A, 
D323A, R722Q, K726H; Tne D137A, D323A, R722N, F730Y; Tne D137A, 
D323A, R722N, K726R; Tne D137A, D323A, R722N, K726H; Tne D137A, 
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D323A,F730S; JweN ! A283, D323A, R722K/H/Q/N/Y/L; TneWA2\9, D323A, 
R722K; TneKA2\9, D323A,F730Y; 7heN'A219,D323A,K726R; 7weN'A219, 
D323A,K726H; rwD137A,D323A,F730S,R722K^T/Q/N/H^,K726Mi; Tne 
D137A, D323A, F730T, R722K/Y/Q/N/H/L, K726R/H; Tne D137A, D323A, 
F730T; Tne F730S; 7>ie F730A; Tne K726R; 7m? K726H; and Tne D137A, 
D323A, R722N. 

It will be understood, however, that other thermostable DN A polymerases 
or mutants thereof any of which are reduced, substantially reduced, or eliminated 
in the ability to add a non-templated 3' terminal nucleotide to the growing strand, 
may be used in the methods of the present invention equivalently. The DNA 
polymerases are used in the methods of the present invention at a concentration 
of about 0.0001 units/ml to about 10 units/ml, preferably at a concentration of 
about 0.001 units/ml to about 5 units/ml, more preferably at a concentration of 
about 0.004 units/ml to about 1 unit/ml, and most preferably at a concentration of 
about 0.04 units/ml. Thus, the methods of the present invention produce a 
population of amplified DNA fragments, most preferably of polymorphic or 
microsatelliteDNA fragments, which comprise substantially no non-templated 3' 
terminal nucleotides. By "substantially no non-templated 3' terminal nucleotides" 
is meant that the population of amplified DNA fragments demonstrates about 0- 
50%, about 0-30%, about 0-20%, preferably about 0-10%, more preferably about 
0-5%, still more preferably about 0-1% and most preferably about 0%, of DNA 
molecules containing non-templated 3 ! nucleotides compared to amplified DNA 
fragments produced by the polymerase control. When testing the ability of a DNA 
polymerase to add 3' non-templated nucleotides, the polymerase, when it has 
substantially reduced or eliminated 3" exonuclease activity, is compared to Tag 
DNA polymerase (see above). When testing polymerases which have been 
modified or mutated to reduce or eliminate 3' non-templated nucleotide addition, 
the mutated or modified polymerase is compared to the corresponding wildtype, 
unmodified or unmutated polymerase (see above). 
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Following amplification by the methods of the present invention, the 
amplified DNA fragments may be analyzed to identify or type a polymorphic, 
minisatellite, microsatellite or STR DNA fragment. This step is usually 
accomplished by separation of the amplified DNA fragments by size, a procedure 
which permits the determination of the presence of unique polymorphic fragments 
in one or more of the DNA samples. The fragments may be separated by any 
physical or biochemical means including gel electrophoresis, capillary 
electrophoresis, chromatography (including sizing, affinity and 
immunochromatography), density gradient centrifugation and immunoadsorption. 
For carrying out the present invention, separation of DNA fragments by gel 
electrophoresis is particularly preferred, as it provides a rapid and highly 
reproducible means of sensitive separation of a multitude of DNA fragments, and 
permits direct, simultaneous comparison of the fragments in several samples of 
DNA, or samples of DNA from a first and a second individual. 

Gel electrophoresis is typically performed on agarose or polyacrylamide 
sequencing gels according to standard protocols, preferably using gels containing 
polyacrylamide at concentrations of 3-12% and most preferably at about 8%, and 
containing urea at a concentration of about 4-12M, most preferably about 8M. 
Samples are loaded onto the gels, usually with samples containing amplified DNA 
fragments prepared from different sources of genomic DNA being loaded into 
adjacent lanes of the gel to facilitate subsequent comparison. Reference markers 
of known sizes may be used to facilitate the comparison of samples. Following 
electrophoretic separation, DNA fragments may be visualized and identified by a 
variety of techniques that are routine to those of ordinary skill in the art, such as 
autoradiography. One can then examine the autoradiographic films either for 
differences in polymorphic fragment patterns ("typing") or for the presence of one 
or more unique bands in one lane of the gel ("identifying"); the presence of a band 
in one lane (corresponding to a single sample, cell or tissue type) that is not 
observed in other lanes indicates that the DNA fragment comprising that unique 
band is source-specific and thus a potential polymorphic DNA fragment. 
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A variety of DNA fragments comprising polymorphic, minisatellite, 
microsatellite or STR DNA fragments can thus be identified using the methods of 
the present invention by comparing the pattern of bands on the films depicting 
various samples. Importantly, using the present methods the amplification 
5 products of the polymorphic DNA fragments will be faithful copies of the template 

(allele) material — Le., they will not exhibit undesired additional nucleotides at 
their 3' termini via non-templated addition of nucleotides by the polymerases. One 
can extend this approach, in another preferred embodiment, to isolate and 
characterize these fragments or any DNA fragment amplified without the non- 
10 templated addition of a 3' terminal nucleotide. In this embodiment, one or more 
of the unique DNA fragments are removed from the gel which was used for 
identification (see above), according to standard techniques such as electrocution 
or physical excision. 

The isolated unique DNA fragments may then be inserted into standard 
15 nucleotide vectors, including expression vectors, suitable for transfection or 

transformation of a variety of prokaryotic (bacterial) or eukaryotic (yeast, plant 
or animal including human and other mammalian) cells. In particular, the present 
invention provides methods of cloning such isolated unique DNA fragments, or 
any PCR-amplified DNA fragment, by blunt-end cloning. As described above, 
20 Taq DNA polymerase adds a non-templated nucleotide, typically a 

deoxyadenosine ("A"), to the 3' terminus of the amplified DNA fragment. Thus, 
Tbfl-catalyzed PCR generates a collection of DNA fragments with 3' A overhangs. 
To clone such 7a#-amplified fragments, two approaches are commonly used: 
either the 3' A overhang is removed by treating the amplified fragment with, for 
25 example, T4 DNA polymerase (a technique sometimes called "3' polishing"), or 

a special cloning vector with a 3* T overhang (a "TA cloning vector") is used. Of 
course, such approaches are more time-consuming and expensive than if direct 
insertion of the amplified fragment were done. Such a direct approach is possible 
using the methods of the invention, which generates little or no 3' A overhangs 
30 (and thus, blunt ends) on the amplified DNA fragments. The DNA fragments, 
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amplified according to the methods of the invention, may thus be directly inserted 
into corresponding blunt-ended vectors according to standard techniques (for 
example, using T4 DNA ligase). Thus, the present invention provides a method 
of blunt-end cloning of a DNA fragment that obviates the use of TA cloning 
vectors or 3' polishing. 

To identify the presence of minisatellite DNA fragments, the polymorphic 
DNA fragments that are identified and isolated by the methods of the present 
invention may be further characterized, for example by sequencing {i.e., 
determining the nucleotide sequence of the polymorphic fragments), by methods 
described above and others that are standard in the art (see, e.g. , U. S . Patent Nos. 
4,962,022 and 5,498,523, which are directed to methods of DNA sequencing). 

Kits 

The invention also provides kits for use in the identification, analysis and 
typing of a polymorphic DNA fragment, particularly a minisatellite or STR DNA 
fragment, according to the present methods. Kits according to the present 
invention may comprise a carrying means being compartmentalized to receive in 
close confinement therein one or more containers such as vials, tubes, bottles and 
the like. Each of such containers may comprise components or a mixture of 
components needed to perform DNA amplification or analysis. 

Such kits may comprise of one or more thermostable DNA polymerases 
reduced, substantially reduced or eliminated in the ability to add a non-templated 
3* nucleotide to a growing DNA strand. Preferably the container contains a 
Thermotoga DNA polymerase or a mutant or a derivative thereof, particularly 
those described in full detail above. The kit may also contain one or more DNA 
primer molecules, one or more deoxyribonucleoside triphosphates needed to 
synthesize a DNA molecule complementary to a DNA template, and/or a buffer 
suitable for amplification of a nucleic acid molecule (or combinations threof). 

A kit for DNA analysis may include one or more of the above components, 
and may further include containers which contain reagents necessary for 
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separation and analysis of DNA fragments, such as polyacrylamide, agarose, urea, 
detergents and the like. 

Of course, it is also possible to combine one or more of these reagents in 
a single tube. A detailed description of such formulations at working 
concentrations is described in co-pending U.S. Application No. 08/689,815 of 
Ayoub Rashtchian and Joseph Solus, entitled "Stable Compositions for Nucleic 
Acid Amplification and Sequencing" filed on August 14, 1996, the disclosure of 
which is incorporated by reference herein in its entirety. 

The invention also relates to kits for detectably labeling molecules, 
sequencing, amplifying and synthesizing molecules by well known techniques. See 
U.S. Patent Nos. 4,962,020, 5,173,411, 4,795,699, 5,498,523, 5,405,776 and 
5,244,797. Such kits may comprise a carrying means being compartmentalized to 
receive in close confinement one or more container means such as vials, test tubes 
and the like. Each of such container means comprises components or a mixture 
of components needed to perform nucleic acid synthesis, sequencing, labeling, or 
amplification. 

A kit for sequencing DNA may comprise a number of container means. 
Such a kit may comprise one or more of the polymerases of the invention, one or 
a number of types of nucleotides needed to synthesize a DNA molecule 
complementary to DNA template, one or a number of different types of 
terminators (such as dideoxynucleoside triphosphates), a pyrophosphatase, one or 
a number of primers and/or a suitable sequencing buffer (or combinations of such 
components). 

A kit used for amplifying or synthesizing of nucleic acids will comprise, 
one or more polymerases of the invention, and one or a number of nucleotides or 
mixtures of nucleotides. Various primers may be included in a kit as well as a 
suitable amplification or synthesis buffers. 

When desired, the kit of the present invention may also include container 
means which comprise detectably labeled nucleotides which may be used during 
the synthesis or sequencing of a nucleic acid molecule. One of a number of labels 
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may be used to detect such nucleotides. Illustrative labels include, but are not 
limited to, radioactive isotopes, fluorescent labels, chemiluminescent labels, 
bioluminescent labels and enzyme labels. 

Use of the Methods and Kits 

The polymeraes, methods and kits embodied in the present invention will 
have general utility in any application utilizing nucleic acid amplification 
methodologies, particularly those directed to the analysis and typing of 
polymorphic or minisatellite DNA fragments, and most particularly those directed 
to the analysis and typing of minisatellite, microsatellite and STR DNA fragments. 
Amplification techniques in which the present methods may be used include PCR 
(U.S. Patent Nos. 4,683,195 and 4,683,202), Strand Displacement Amplification 
(SDA; U.S. Patent No. 5,455,166; EP 0 684 315), and Nucleic Acid Sequence- 
Based Amplification (NASBA; U.S. Patent No. 5,409,818; EP 0 329 822). 
Nucleic acid analysis and typing techniques which may employ the present 
compositions include nucleic acid sequencing methods such as those disclosed in 
U.S. Patent Nos. 4,962,022 and 5,498,523, as well as more complex PCR-based 
nucleic acid fingerprinting techniques such as Random Amplified Polymorphic 
DNA (RAPD) analysis (Williams, J.G.K., et aL 9 NucL Acids Res. J8(22):653\- 
6535, 1990), Arbitrarily Primed PCR (AP-PCR; Welsh, J., and McClelland, M., 
NucL Acids Res. 75(2^:7213-7218, 1990), DNA Amplification Fingerprinting 
(DAF; Caetano-Anolles et al. y Bio/Technology 9:553-557, 1991), and 
microsatellite PCR or Directed Amplification of Minisatellite-region DNA 
(DAMD; Heath, D.D., et aL, NucL Acids Res. 21(24): 5782-5785, 1993). In 
particular, the polymerases, methods and kits of the present invention will be 
useful in the fields of medical genetics, therapeutics and diagnostics, forensics 
(particularly identity and paternity testing), and agricultural {e.g., plant breeding) 
and other biological sciences, in any procedure utilizing DNA polymerases for 
analysis and typing of polymorphic, minisatellite, microsatellite or STR DNA 
fragments. Particularly suitable for diagnosis by the methods of the present 
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invention are genetic diseases such as cystic fibrosis, hemophilia, Alzheimer's 
disease, schizophrenia, muscular dystrophy or multiple sclerosis. Together, these 
abilities will assist medical professionals and patients in diagnostic and prognostic 
determinations as well as in the development of treatment and prevention regimens 
for these and other disorders. 

It will also be apparent to one of ordinary skill in the art that the present 
methods may be used to screen animal tissues to be subsequently used in medical 
procedures such as tissue or organ transplants, blood transfusions, zygote 
implantations and artificial inseminations. In such procedures, pre-screening of the 
subject tissues for the presence of particular polymorphic DNA fragments may 
improve the success of tissue or organ transplants (by decreasing the likelihood 
of rejection due to donor-recipient genetic incompatibility) and of zygote 
implantations (by eliminating the use of genetically defective zygotes). Similarly, 
use of these methods will reduce the chances of transmission of infectious diseases 
(e.g., hepatitis and AIDS) in medical procedures that are often prone to such 
transmission, such as blood transfusions and artificial insemination. Finally, use 
of the present invention for identification of unique polymorphic, minisatellite, 
microsatelliet and STR DNA fragments will assist in forensic science in such 
applications as crime-scene analysis of blood, tissue and body secretions 
containing small amounts of DNA, as well as in paternity testing. 

It will be readily apparent to those skilled in the relevant arts that other 
suitable modifications and adaptations to the methods and applications described 
herein are obvious and may be made without departing from the scope of the 
invention or any embodiment thereof Having now described the present invention 
in detail, the same will be more clearly understood by reference to the following 
examples, which are included herewith for purposes of illustration only and are not 
intended to be limiting of the invention. 
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Example 1: Bacterial Strains And Growth Conditions 

Thermotoga neapolitana DSM No. 5068 was grown under anaerobic 
conditions as described in the DSM catalog (addition of resazurin, Na^, and 
sulfur granules while sparging the media with nitrogen) at 85°C in an oil bath from 

5 1 2 to 24 hours. The cells were harvested by filtering the broth through Whatman 

#1 filter paper. The supernatant was collected in an ice bath and then centrifiiged 
in a refrigerated centrifuge at 8,000 rpms for twenty minutes. The cell paste was 
stored at -70°C prior to total genomic DNA isolation. 

E. coli strains were grown in 2X LB broth base (Lennox L broth base: 

10 GIBCO/BRL) medium. Transformed cells were incubated in SOC (2% tryptone, 

0.5% yeast extract, yeast 10 mM NaCl, 2.5 mM KC1, 20mM glucose, lOmM 
MgCl 2 , and lOmM MgS0 4 per liter) before plating. When appropriate antibiotic 
supplements were 20 mg/1 tetracycline and 100 mg/1 ampicillin. E. coli strain 
DH10B (Lorow et al. y Focus 12:19-20 (1990)) was used as host strain. 

15 Competent DH10B may be obtained from Life Technologies, Inc. (LTI) 

(Rockville, MD). 

Example 2: DNA Isolation 

Thermotoga neapolitana chromosomal DNA was isolated from 1. lg of 
cells by suspending the cells in 2.5 ml TNE (50mM Tris-HCl, pH 8.0, 50mM 

20 NaCl, lOmM EDTA) and treated with 1% SDS for 10 minutes at 37°C. DNA 

was extracted with phenol by gently rocking the lysed cells overnight at 4°C. The 
next day, the lysed cells were extracted with chloroform:isoamyl alcohol. The 
resulting chromosomal DNA was further purified by centrifiigation in a CsCl 
density gradient. Chromosomal DNA isolated from the density gradient was 

25 extracted three times with isopropanol and dialyzed overnight against a buffer 

containing 10 mM Tris-HCl (pH 8.0) and 1 mM EDTA (TE). 
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Example 3: Construction of Genomic Libraries 

The chromosomal DNA isolated in Example 2 was used to construct a 
genomic library in the plasmid pCP13. Briefly, 10 tubes each containing lOpg of 
Thermotoga neapolitana chromosomal DNA was digested with 0.01 to 10 units 
of &W/IIIA1 for 1 hour at 37°C. A portion of the digested DNA was tested in an 
agarose (1 .2%) gel to determine the extent of digestion. Samples with less than 
50% digestion were pooled, ethanol precipitated and dissolved in TE. 6.5 \xg of 
partially digested chromosomal DNA was ligated into 1 .5 jig of pCP13 cosmid 
which had been digested with BamYR restriction endonuclease and 
dephosphorylated with calf intestinal alkaline phosphatase. Ligation of the 
partially digested Thermotoga DNA and BamHl cleaved pCP 1 3 was carried out 
with T4 DNA ligase at 22°C for 16 hours. After ligation, about ljig of ligated 
DNA was packaged using ^-packaging extract (obtained from Life Technologies, 
Inc., Rockville, MD). DH10B cells (Life Tech. Inc.) were then infected with 100 
Hi of the packaged material. The infected cells were plated on tetracycline 
containing plates. Serial dilutions were made so that approximately 200 to 300 
tetracycline resistant colonies were obtained per plate. 

Example*: Screening/or Clones Expressing Thermotoga neapolitana DNA 
Polymerase 

Identification of the Thermotoga neapolitanaDNA polymerase gene of the 
invention was cloned using the method of Sagner et al t Gene 97: 1 1 9- 1 23 ( 1 99 1 ) 
which reference is herein incorporated in its entirety. Briefly, the £ coli 
tetracycline resistant colonies from Example 3 were transferred to nitrocellulose 
membranes and allowed to grow for 12 hours. The cells were then lysed with the 
fumes of chloroform:toluene (1:1) for 20 minutes and dried for 10 minutes at 
room temperature. The membranes were then treated at 95°C for 5 minutes to 
inactivate the endogenous E. coli enzymes. Surviving DNA polymerase activity 
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was detected by submerging the membranes in 15 ml of polymerase reaction mix 
(50 mM Tris-HCl (pH 8.8), 1 mM MgCl 2 , 3 mM p-mercaptoethanol, 10 \iM 
dCTP, dGTP, dTTP, and 15 \iCi of 3,000 Ci/mmol [ct 32 P]dATP) for 30 minutes 
at65°C 

Using autoradiography, three colonies were identified that expressed a 
Thermotoga neapolitana DNA polymerase. The cells were grown in liquid 
culture and the protein extract was made by sonication. The presence of the 
cloned thermostable polymerase was confirmed by treatment at 90°C followed by 
measurement of DNA polymerase activity at 72°C by incorporation of radioactive 
deoxyribonucleoside triphosphates into acid insoluble DNA. One of the clones, 
expressing Tne DNA polymerase, contained a plasmid designated pCP13-32 and 
was used for further study. 

Example 5: Subcloning of Tne DNA polymerase 

Since the pCP13-32 clone expressing the Tne DNA polymerase gene 
contains about 25 kb of T. neapolitanaVNA, subcloning a smaller fragment of the 
Tne polymerase gene was attempted. The molecular weight of the Tne DNA 
polymerase purified from E. co///pCP13-32 was about 100 kd. Therefore, a 
2.5-3.0 kb DNA fragment will be sufficient to code for full-length polymerase. A 
second round of Sat/3 A partial digestion similar to Example 3 was done using 
pCP13-32 DNA. In this case, a 3.5 kb region was cut out from the agarose gel, 
purified by Gene Clean (BIO 101,LaJolla, CA) and ligated into plasmid pSport 1 
(Life Technologies, Inc.) which had been linearized with BamHl and 
dephosphorylated with calf intestinal alkaline phosphatase. After ligation, DH1 0B 
was transformed and colonies were tested for DNA polymerase activity as 
described in Example 1 . Several clones were identified that expressed Tne DNA 
polymerase. One of the clones (pSport-7he) containing about 3 kb insert was 
further characterized. A restriction map of the DNA fragment is shown in Fig. 1 . 
Further, a 2.7 Kb HindUl-Ssil fragment was subcloned into pUC19 to generate 
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pUC19-r«e. E. co///pUC19-7w also produced Tne DNA polymerase. £. co// 
DH10B (pUC19-7we) was deposited on September 30, 1994 with the Collection, 
Agricultural Research Culture Collection (NRRL), 1815 Peoria, IL 61604 as 
Deposit No. NRRL B-21338. The nucleotide and amino acid sequence of Tne 
polymerase is described in U.S. application serial nos. 08/706,702 and 08/706,706 
filed September 9, 1996, both of which are incorporated by reference herein. 



Example 6: Purification ofThermotoga neapolitana DNA Polymerase from 
K coli 

Twelve grams of E. coli cells expressing cloned Tne DNA polymerase 

10 (DH1 0B/pSport-7we) were lysed by sonication (four thirty-second bursts with a 

medium tip at the setting of nine with a Heat Systems Ultrasonics Inc., model 375 
sonicator) in 20 ml of ice cold extraction buffer (50 mM Tris HC1 (pH 7.4), 8% 
glycerol, 5 mM mercaptoethanol, 10 mM NaCl, 1 mM EDTA, 0.5 mM PMSF). 
The sonicated extract was heated at 80°C for 15 min. and then cooled in ice for 

15 5 min. 50 mM KC1 and PEI (0.4%) was added to remove nucleic acids. The 

extract was centrifuged for clarification. Ammonium sulfate was added to 60%, 
the pellet was collected by centrifugation and resuspended in 10 ml of column 
buffer (25 mM Tris-HCl (pH 7.4), 8% glycerol, 0.5% EDTA, 5mM 
2-mercaptoethanol, 10 mM KC1). A Blue r Sepharose (Pharmacia) column, or 

20 preferably a Toso heparin (Tosohaas) column, was washed with 7 column volumes 

of column buffer and eluted with a 15 column volume gradient of buffer from 
10mMto2MKCl. Fractions containing polymerase activity were pooled. The 
fractions were dialyzed against 20 volumes of column buffer. The pooled 
fractions were applied to a Toso650Q column (Tosohaas). The column was 

25 washed to baseline OD 280 and elution effected with a linear 10 column volume 

gradient of 25 mM Tris (pH 7.4), 8% glycerol, 0.5 mM EDTA, 10 mM KC1, 5 
mM P-mercaptoethanol to the same buffer plus 650 mM KC1. Active fractions 
were pooled. 
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Example 7: Construction ofThermotoga neapolitana 3 -to-5 'Exonuclease 
Mutant 

The amino acid sequence of portions of the Tne DNA polymerase was 
compared with other known DNA polymerases such as K coli DNA 

5 polymerase 1, Taq DNA polymerase, T5 DNA polymerase, and T7 DNA 

polymerase to localize the regions of 3'-to-5' exonuclease activity, and the dNTP 
binding domains within the DNA polymerase. One of the 3*-to-5' exonuclease 
domains was determined based on the comparison of the amino acid sequences of 
various DNA polymerases (Blanco, L., et a). Gene 112: 139-144 (1992); 

10 Braithwaite and Ito, Nucleic Acids Res. 21: 787-802 (1993)) is as follows: 



Tne 318 PSFALDLETSS 

Pol 1 350 PVFAFDTETDS 

T5 159 GPVAFDSETSA 

T7 1 MTVSDIEANA 



328 (SEQIDNO:18) 

360 (SEQIDNO:19) 

169 (SEQK)NO:20) 

10 (SEQIDNO:21) 



15 As a first step to make the Tne DNA polymerase devoid of 3'-»5' 

exonuclease activity, a 2kb Sph fragment from pSport-Tne was cloned into 
M13mpl9 (LTI, Rockville, MD). The recombinant clone was selected in E. coli 
DH5ccFIQ (LTI, Rockville, MD). One of the clones with the proper insert was 
used to isolate uracilated single-stranded DNA by infecting E. coli CJ236 (Biorad, 

20 California) with the phage particle obtained from E. coli DH5ccPIQ. An 

oligonucleotide, GA CGT TTC AAG CGC TAG GGC AAA AGA (SEQ ID 
NO:22) was used to perform site directed mutagenesis. This site-directed 
mutagenesis converted Asp 323 (indicated as * above) to Ala 323 . An £co47III 
restriction site was created as part of this mutagenesis to facilitate screening of the 

25 mutant following mutagenesis. The mutagenesis was performed using a protocol 

as described in the Biorad manual (1987) except T7 DNA polymerase was used 
instead of T4 DNA polymerase (USB, Cleveland, OH). The mutant clones were 
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screened for the Eco47Ul restriction site that was created in the mutagenic 
oligonucleotide. One of the mutants having the created Eco47lU restriction site 
was used for further study. The mutation Asp 323 to Ala 323 was confirmed by DNA 
sequencing. 

To incorporate the 3'-* 5' exonuclease mutation in an expression vector, the 
mutant phage was digested with Sphl and HindllV A 2 kb fragment containing 
the mutation was isolated. This fragment was cloned in pUC-Tne to replace the 
wild type fragment. See Figure 2A. The desired clone, pUC-Tne (3'-»5 # ), was 
isolated. The presence of the mutant sequence was confirmed by the presence of 
the unique Eco47m site. The plasmid was then digested with Sstl and HindOl. 
The entire mutant polymerase gene (2.6 kb) was purified and cloned into Sstl and 
Hindlll digested pTrc99 expression vector (Pharmacia, Sweden). The clones 
were selected in DH10B (LTI, Rockville, MD). The resulting plasmid was 
designated pTrcTne35. See Figure 2B. This clone produced active heat stable 
DNA polymerase. 

Example 8: Phenylalanine to Tyrosine Mutant 

The polymerase active site including the dNTP binding domain is usually 
present at the carboxyl terminal region of the polymerase. The sequence of the 
Tne polymerase gene suggests that the amino acids that presumably contact and 
interact with the dNTPs are present within the 694 bases starting at the internal 
BamBl site. See Figure 1. This conclusion is based on homology with a 
prototype polymerase E. coli DNA polymerase I. See Polisky et al., J. BioL 
Chem. 265:14579-14591 (1990). A comparison was made of the O-helix for 
various polymerases: 
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5 



Tne 


722 


RRVGKMVNFSIIYG 


735 


(SEQIDNO:12) 


Poll 


754 


RRSAKAINFGLIYG 


767 


(SEQIDNO:13) 


T5 


562 


RQAAKAITFGILYG 


575 


(SEQIDNO:14) 


T7 


518 


RDNAKTFIYGFLYG 


531 


(SEQIDNO:15) 


7ag 


659 


RRAAKTINFGVLYG 


672 


(SEQIDNO:16) 



In order to change Phe 730 of the Tne polymerase to a Tyr 730 site directed 
mutagenesis was performed using the oligonucleotide GTA TAT TAT AGA GTA 
GTT AAC CAT CTT TCC A (SEQ ID NO:23). As part of this oligonucleotide 
directed mutagenesis, a Hpal restriction site was created in order to screen 
10 mutants easily. The same uracilated single-stranded DNA and mutagenesis 

procedure described in Example 7 were used for this mutagenesis. Following 
mutagenesis, the mutants were screened for the Hpal site. Mutants with the 
desired Hpal site were used for further study. The mutation was confirmed by 
DNA sequencing. 

15 The Phe 730 to Tyr 730 mutation was incorporated into pUC-Tne by replacing 

the wild type Sphl -Hindi!! fragment with the mutant fragment obtained from the 
mutant phage DNA. The presence of the desired clone, pUC-TneFY, was 
confirmed by the presence of the unique Hpal site, see Figure 2A. The entire 
mutant polymerase gene was subcloned into pTrc99 as an Sstl-Hindlll fragment 

20 as described above inDHlOB. The resulting plasmid was designated pTrcTneFY. 

(Figure 2B). The clone produced active heat stable polymerase. 

Example 9: 3 '->5' Exonuclease and Phe m ->Tyr 730 Double Mutants 

In order to introduce the 3'-*5' exonuclease mutation and the 
Phe 730 ->Tyr 730 mutation in the same expression vector, pTrc99, it was necessary 
25 to first reconstitute both mutations in the pUC-Tne clone. See Figure 3 . Both the 

pUC-Tne (3'-> 5 1 ) and the pUC-TneFY were digested with BamHl. The digested 
pUC-Tne (3'-+5') was dephosphorylated to avoid recirculation in the following 
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ligations. The resulting fragments were purified on a 1% agarose gel. The largest 
Barrim fragment (4.4 kb) was purified from pUC-Tne (y->5 1 ) digested DNA and 
the smallest BarriHl fragment (0.8 kb) containing the Phe 730 -*Tyr 730 mutation was 
purified and ligated to generate pUC-Tne35FY. The proper orientation and the 
5 presence of both mutations in the same plasmid was confirmed by Eco47Ul, Hpal y 

and Sphl-Hindttl restriction digests. See Figure 3. 

The entire polymerase containing both mutations was subcloned as a Ssii- 
Hindlll fragment in pTrc99 to generate pTrcTne35FY in DH10B. The clone 
produced active heat stable polymerase. 

10 Example 10: 3' ->5 f Exonuclease, 5 '->3' Exonuclease, and Phe m ->Tyt J30 

Triple Mutants 

In most of the known polymerases, the y-to-T exonuclease activity is 
present at the amino terminal region of the polymerase (Ollis, D.L., ei al , Nature 
313, 762-766, 1985;Freemont,P.S., et al y Proteins 1,66-73, 1986; Joyce, CM., 

15 Curr. Opin. Struct Biol 1: 123-129 (1991). There are some conserved amino 

acids that are implicated to be responsible for 5'-to-3' exonuclease activity 
(Gutman and Minton, NucL Acids Res. 21, 4406-4407, 1993). See supra. It is 
known that S-to-3 1 exonuclease domain is dispensable. The best known example 
is the Klenow fragment of K coli Pol I. The Klenow fragment is a natural 

20 proteolytic fragment devoid of 5*-to-3' exonuclease activity (Joyce, CM, et al y 

J. Biol. Chem. 257, 1958-1964, 1990). In order to generate an equivalent mutant 
for Tne DNA polymerase devoid of 5-to-3' exonuclease activity, the presence of 
a unique Sphl site present 680 bases from the Sstl site was exploited. 
pUC-Tne35FY was digested with Hindm, filled-in with Klenow fragment to 

25 generate a blunt-end, and digested with Sphl. The 1.9 kb fragment was cloned 

into an expression vector pTTQl 9 (Stark, M.J.R., Gene 5 1 , 255-267, 1 987) at the 
Sphl-Smal sites and was introduced into DH10B. This cloning strategy generated 
an in-frame polymerase clone with an initiation codon for methionine from the 
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vector. The resulting clone is devoid of 219 amino terminal amino acids of Tne 
DNA polymerase. This clone is designated as pTTQTne535FY (Fig. 4). The 
clone produced active heat stable polymerase. No exonuclease activity could be 
detected in the mutant polymerase as evidenced by lack of presence of unusual 
sequence ladders in the sequencing reaction. This particular mutant polymerase 
is highly suitable for DNA sequencing. 



Example 11: 5 f ->? Exonuclease Deletion and Phe 730 ^Tyr m Substitution 
Mutant 

In order to generate the 5'-to-3* exonuclease deletion mutant of the Tne 

10 DNA polymerase Phe^^Tyr 730 mutant, the 1.8 kb Sphl-Spel fragment of 

pTTQTne535FY was replaced with the identical fragment of pUC-Tne FY. See 
Fig. 4. A resulting clone, pTTQTne5FY, produced active heat stable DNA 
polymerase. As measured by the rate of degradation of a labeled primer, this 
mutant has a modulated, low but detectable, 5'-to-3' exonuclease activity 

15 compared to wild type Tne DNA polymerase. M13/pUC Forward 23-Base 

Sequencing Primer™, obtainable from LTI, Rockville, MD, was labeled at the 5' 
end with [P 32 ] ATP and T4 kinase, also obtainable from LTI, Rockville, MD, as 
described by the manufacturer. The reaction mixtures contained 20 units of either 
wildtype or mutant Tne DNA polymerase, 0.25 pmol of labeled primer, 20 mM 

20 tricine, pH 8.7, 85 mM potassium acetate, 1.2 mM magnesium acetate, and 8% 

glycerol. Incubation was carried out at 70°C. At various time points, 10 ^il 
aliquots were removed to 5 nl cycle sequencing stop solution and were resolved 
in a 6 % polyacrylamide sequencing gel followed by andoradiography. While the 
wildtype polymerase degraded the primer in 5 to 1 5 minutes, it took the mutant 

25 polymerase more than 60 minutes for the same amount of degradation of the 

primer. 
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Example 12: Purification of the Mutant Polymerases 

The purification of the mutant polymerases was done essentially as 
described Example 6, supra, with minor modifications. Specifically, 5 to 1 0 grams 
of cells expressing cloned mutant Tne DNA polymerase were lysed by sonication 

5 with a Heat Systems Ultrasonic, Inc. Model 375 machine in a sonication buffer 

comprising 50 mM Tris-HCl (pH 7.4); 8% glycerol; 5 mM 2-mercaptoethanol, 10 
mM NaCl, 1 mM EDTA, and 0.5 mM PMSF. The sonication sample was heated 
at 75°C for 1 5 minutes. Following heat treatment, 200 mM NaCl and 0.4% PEI 
was added to remove nucleic acids. The extract was centrifuged for clarification. 

10 Ammonium sulfate was added to 48%, the pellet was resuspended in a column 

buffer consisting of 25 mM Tris-HCl (pH 7.4); 8% glycerol; 0.5% EDTA; 5 mM 
2-mercaptoethanol; 10 mM KC1 and loaded on a heparin agarose (LTI) column. 
The column was washed with 10 column volumes using the loading buffer and 
eluted with a 10 column volume buffer gradient from 10 mM to 1 M KC1. 

15 Fractions containing polymerase activity were pooled and dialyzed in column 

buffer as above with the pH adjusted to 7.8. The dialyzed pool of fractions were 
loaded onto a MonoQ (Pharmacia) column. The column was washed and eluted 
as described above for the heparin column. The active fractions are pooled and 
a unit assay was performed. 

20 The unit assay reaction mixture contained 25 mM TAPS (pH 9.3), 2 mM 

MgCl 2 , 50 mM KC1, 1 mM DTT, 0.2 mM dNTPs, 500 *ig/ml DNAse I treated 
salmon sperm DNA 21 mCi/ml [otP 32 ] dCTP and various amounts of polymerase 
in a final volume of 50 ul. After 10 minutes incubation at 70°C, 10 ul of 0.5 M 
EDTA was added to the tube. TCA perceptible counts were measured in GF/C 

25 filters using 40 ul of the reaction mixture. 
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Example 13: Generation of5'-to-3' exonuclease mutant of full length Tne 
DNA polymerase 

1, Identification of Two Amino Acids Responsible for 5M0-3' 
Exonuclease Activity 

Tne DNA polymerase contains three enzymatic activities similar to E. coli 
DNApolymerasel: S-to-3 1 DNA polymerase activity, S'-to-S 1 exonuclease activity 
and 5-to-3' exonuclease activity. This example is directed to the elimination of the 
5 ! -to-3' exonuclease activity in full length Tne DNA polymerase. Gutman and 
Minton {Nucleic Acids Res, 1993, 27, 4406-4407) identified six (A-F) conserved 
5'-to-3' exonuclease domains containing a total of 1 0 carboxylates in various DNA 
polymerases in the poll family. Seven out of 10 carboxylates (in domains A, D 
and E) have been implicated to be involved in divalent metal ions binding as 
judged from the crystal structure (Kim et al. Nature, 1995, 376, 612-616) of Taq 
DNA polymerase. However, there was no clear demonstration that these 
carboxylates are actually involved 5 f -to-3* exonuclease activity. In order to find 
out the biochemical characteristics of some of these carboxylates, two of the 
aspartic acids in domains A and E were chosen for mutagenesis. The following 
aspartic acids in these two domains were identified: 



Tne DNA polymerase: 5 FLFD 8 GT 10 (domain A) (SEQ ID NO:24) 

Taq DNA polymerase: 15 LLVD 18 GH 20 (SEQ ID NO:25) 

and 

Tne DNA polymerase: 132 SLITGD 137 KDML141 (domain E) (SEQ ID 

NO:26) 

Taq DNA polymerase: 137 RILT AD l42 KDLY 1 46 (SEQ ID NO:27) 



2, Isolation of Single Stranded DNA for Mutagenesis 



Single stranded DNA was isolated from pSportTne (see infra). pSportTne 
was introduced into DHSotFIQ (LTI, Rockville, MD) by transformation. A single 
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colony was grown in 2 ml Circle Grow (Bio 101, CA) medium with ampicillin at 
37°C for 16 hrs. A 10 ml fresh media was inoculated with 0.1 ml of the culture 
and grown at 37°C until the A590 reached approximately 0.5. At that time, 0.1 
ml of M13K07 helper phage (1X10 11 pfu/ml, LTI) was added to the culture. The 
infected culture was grown for 75 min. Kanamycin was then added at 50 |ig/ml, 
and the culture was grown overnight (16 hrs.). The culture was spun down. 9 ml 
of the supernatant was treated with 50 jig each of RNaseA and DNasel in the 
presence of 1 0 mM MgCl 2 for 30 min. at room temperature. To this mixture, 0.25 
volume of a cocktail of 3M ammonium acetate plus 20% polyethylene glycol was 
added and incubated for 20 min. on ice to precipitate phage. The phage was 
recovered by centrifugation. The phage pellet was dissolved in 200 nl of TE 
(10 mM Tris-HCl (pH 8) and 1 mM EDTA). The phage solution was extracted 
twice with equal volume of buffer saturated phenol (LTI, Rockville, MD), twice 
with equal volume of phenol:chloroform:isoamyl alcohol mixture (25:24: 1, LTI, 
Rockville, MD) and finally, twice with chloroform: isoamyl alcohol (24:1). Tothe 
aqueous layer, 0. 1 volume of 7.5 M ammonium acetate and 2.5 volume of ethanol 
were added and incubated for 15 min. at room temperature to precipitate single 
stranded DNA. The DNA was recovered by centrifugation and suspended in 
200 \xl TE. 

3. Mutagenesis of D 8 and D U7 

Two oligos were designed to mutagenize D 8 and D 137 to alanine. The 
oligos are: 5' GTAGGCCAGGGCTGTGCCGGCAAAGAGAAATAGTC 3' 
(D8A) (SEQ ID NO:28) and 5' GAAGCATATCCTTGGCGCCGGTTAT 
TATGAAAATC 3' (D137A) (SEQ ID NO:29). In the D8A oligo a NgoAIV 
(bold underlined) and in the oligo D137A a Kasl (bold underlined) site was 
created for easy identification of clones following mutagenesis. 200 pmol of each 
oligo was kinased according to the Muta-gene protocol (Bio-Rad, CA) using 5 
units of T4 Kinase (LTI, Rockville, MD). 200 ng of single stranded DNA was 
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annealed with 2 pmol of oligo according to the Muta-gene protocol. The reaction 
volume was 10 Following the annealing step, complementary DNA synthesis 
and ligation was carried out using 5 units of wildtype T7 DNA polymerase (USB, 
Ohio) and 0.5 unit T4 ligase (LTI). 1 jxl of the reaction was used to transform a 
MutS E. coli (obtainable from Dr. Paul Modrich at the Duke University, NC) and 
selected in agar plates containing ampicillin. A control annealing and synthesis 
reaction was carried out without addition of any oligo to determine the 
background. There were 50-60 fold more colonies in the transformation plates 
with the oligos than without any oligo. Six colonies from each mutagenic oligo 
directed synthesis were grown and checked for respective restriction site (NgoAlW 
or Kasl). For D8A (NgoATV), 4 out of 6 generated two fragments (3 kb and 4. 1 
kb). Since pSportTne has an NgoAIV site near the fl intergenic region, the new 
NgoAJV site within the Tne DNA polymerase produced the expected fragments. 
The plasmid was designated as pSportTneNgoAIV. For D137A (Kasl), 5 out of 
6 clones produced two expected fragments of 1.1 kb and 6 kb in size. Since 
pSportTne has another Kasl site, the newly created Kasl site generated these two 
expected fragments. The plasmid was designated as pSportTneKasI. Both D8A 
and D137A mutations were confirmed by DNA sequencing. 

4. Reconstruction of the Mutant Polymerase into Expression Vector 

During the course of expression of Tne DNA polymerase or mutant Tne 
DNA polymerase, a variety of clones were constructed. One such clone was 
designated as pTTQ Tne SeqSl. This plasmid was constructed as follows: first, 
similar to above mutagenesis technique glycine 195 was changed to an aspartic 
acid in pSportTne. A mutation in the corresponding amino acid in E. coli DNA 
polymerasel (polA2 14, domain F) was found to have lost the S'-to-S 1 exonuclease 
activity (Gutman and Minton, see above). An Sspl site was created in the mutant 
polymerase. Second, a 650 bp Ssil-Sphl fragment containing the Gl 95D mutation 
was subcloned in pUCTne35FY (see infra) to replace the wild type fragment. 
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This plasmid was called pUCTne3022. Finally, the entire mutant Tne DNA 
polymerase was subcloned from pUCTne3022 into pTTQ18 as SstVHindlW 
fragment to generate pTTQTneSeqS 1 . To introduce the mutation D8A or D 1 37A 
in this expression vector, the 650 bp Sstl-Sphl was replaced with the same Sstl- 
5 Sphl fragment from pSportTneNgoATV or pSportTneKasI. The plasmids were 

designated as pTTQTneNgo(D8A) and pTTQTneKas(D 1 3 7 A), respectively. 

5. Confirmation of the Mutations by DNA Sequencing 

DNA sequencing of both mutant polymerases confirmed the presence of 
the restriction site NgoAXV as well as the mutation D8 A; and Kas\ site as well as 
10 the mutation D137A. Also confirmed by DNA sequencing was the presence of 

the mutation D323A and the£co47IH restriction site in the exonuclease 
region. In addition, confirmed by DNA sequencing was the F730Y mutation and 
the Hpal restriction site in the O-helix region of the mutant Tne DNA polymerase. 

6. S'-to^ 1 exonuclease Activity of the Mutant Tne DNA Polymerases 

The full length mutant DNA polymerase was purified as described above. 
The 5*-to-3' exonuclease activity was determined as described in the LTI catalog. 
Briefly, 1 pmol of labeled ( 32 P) HaeXLl digested X DNA (LTI) was used for the 
assay. The buffer composition is: 25 mM Tris-HCl (pH 8.3), 5 mM MgCl 2 , 50 
mM NaCl, 0.01% gelatin. The reaction was initiated by the addition of 0, 2, 4, 6 
and 10 units of either wild type or mutant Tne DNA polymerase in a 50 \il 
reaction. The reaction mix was incubated for 1 hr at 72°C. A 10 \xl aliquot was 
subjected to PEI-cellulose thin layer chromatography and the label released was 
quantitated by liquid scintillation. In this assay, both D8A and D137A mutants 
showed less than 0.01% label release compared to the wild type Tne DNA 
polymerase. The result demonstrates that in both D8A and D137A mutants the 
5 *-to-3 f exonuclease activity has been considerably diminished. Thus, it has been 
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confirmed that these two aspartates are involved with the S'-toO 1 exonuclease 
activity. 

Example 14: Generation of double mutants, R722K/F730Y, R722Q/F730Y, 
R722H/F730Y and R 722N/F730 Y of Tne DNA polymerase 

For all mutations, the PCR method was used. A common S'-oligo, CAC 
CAG ACG GGTACC GCC ACT GGC AGG TTG (SEQ ID NO:30), was used. 
This oligo contains a Kpnl site (shown above in bold italics). The template used 
for PCR was pTTQTneSeqSl (Example 13) which already contains the F730Y 
mutation in the Tne polymerase gene. For the R722K/F730Y mutation, the oligo 
used was TAT AGA GTA GTTAAC CAT CTT TCC AAC CCG TTT CAT TTC 
TTC GAA CAC (SEQ ID NO:3 1). For the R722Q/F730Y mutation, the oligo 
used was TAT AGA GTA GTTAA C CAT CTT TCC AAC CCG TTG CAT TTC 
TTC GAA CAC (SEQ ID NO:32). For the R722N/F730Y mutation, the oligo 
used was TAT AGA GTA GTT AAC CAT CTT TCC AAC CCG GTT CAT TTC 
TTC GAA CAC (SEQ ID NO:33) and for the R722H/F730Y the oligo used was 
TAT AGA GTA GTT AAC CAT CTT TCC AAC CCG ATG CAT TTC TTC 
GAA CAC (SEQ ID NO:34). Each of these oligos contains a Hpal site (bold 
italics). The underlined codons were the mutated codons for arginine at the 
position 722 for respective amino acids. The PCR generated a 3 1 8 bp product 
containing a Kpnl and a Hpal site. The PCR products were digested with Kpnl 
and Hpal and cloned into pUC-TneFY digested with Kpnl and Hpal to replace the 
original fragment to generate pUC19TneFY-R722K, P UC19TneFY-R722Q, 
pUC19TneFY-R722H and pUC19TneFY-R722N. Finally, the KpnI-Hindlll 
fragment (~800bp) of pTTQTneKasI(D137A) was replaced by the -800 bp Kpnl- 
Hindlll fragment from these plasmids to generate pTnell (R722K/F730Y), 
pTnelO (R722Q/F730Y), pTnel3 (R722H/F730Y) and pTne9 (R722N/F739Y), 
respectively. The mutations were confirmed by DNA sequencing. 
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Example 15: Generation ofTneDNA Polymerase mutants F730A andF730S 

F730A was constructed using PCR. The forward oligo was AAG ATG 
GTT AAC GCG TCT ATA ATA TAC GG (SEQ ID NO:35) which contains a 
Hpal site and a Mlul site (bold italics). The reverse oligo was CAA GAG GCA 
CAG AGA GTT TCA CC (SEQ ID NO:36) which anneals downstream of Spel 
present in the Tne polymerase gene . The template used for PCR was pTTQTne 
KasI (D137A). The 482bp PCR product was digested with Hpal and Spel and 
cloned into pUC-TneFY thereby replacing the amino acid tyrosine at position 730 
with alanine. This construct was called pUC-Tne FA. 

F730S was constructed by site directed mutagenesis. The oligo was GTA 
TAT TAT AGA GGA GTT AAC CAT CTT TCC (SEQ IDNO:37) where ai/pal 
site was created (bold italics). The single stranded DNA used was isolated from 
pSport-Tne that contains the double mutation D 137 A and D323A. This construct 
was designated pTne 47. The Tne polymerase gene was then cloned as an Ssil and 
i/wdlll fragment into the plasmid pUC19 and the resulting clone was designated 
pTnelOl. 

Example 16: Generation of Tne DNA polymerase with a Hpal site in front of 
the amino acid phenylalanine at position 730. 

A construct of Tne polymerase was made using PCR where a Hpal 
restriction enzyme site was introduced into the gene in front of the amino acid 
phenylalanine at position 730. The forward oligonucleotide was AAG ATG GTT 
AAC11C TCT ATA ATA TAC GG (SEQ ID NO:3 8) which contains *Hpa\ site 
(shown above in bold italics) and the reverse oligo was the same as in Example 1 5 
above. The template used for PCR was pTne33 which contains the Tne 
polymerase gene with D137A and D323A mutations cloned in pUC19. The 
482bp PCR product was digested with Hpal and Spel and was used to replace the 
corresponding fragment in pTnelOl (see example 15). The construct was 
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sequenced to verify that the amino acid at position 730 was indeed phenylalanine 
and the plasmid was numbered pTnel06. 

Example 17: Generation of double mutants R722Y/F730A and R722UF730A 
of the Tne DNA polymerase. 

For both the mutations PCR method was used. The common 5' oligo was 
the same as in Example 1 4. For R722Y/F730A mutation the oligo used was TAT 
AGA GTA GTTAAC CAT CTT TCC AAC CCG GTA CATGTC TTC GTT 
CAC (SEQ ID NO:39). For R722L/F730A mutation the oligo used was TAT 
AGA GTA GTTAAC CAT CTT TCC AAC CCG CAA CATGTC TTC GTT 
CAC (SEQ ID NO:40). Each of these oligos contain a Hpal site (shown above 
in bold italics). The underlined codons were the mutated codons for arginine at the 
position 722 for respective amino acids. An Aflttl site was also created (shown 
above in bold italics next to the underlined codon) in order to confirm the 
mutation. The PCR generated a 3 1 8 bp product containing a Kpnl and nHpal site. 
The PCR products were digested with Kpnl and Hpal and cloned into pUC- 
TneFA (see example 15). The constructs were named as pUCTneYA and 
pUCTneLA. 

Example 18: Generation of Tne DNA Polymerase mutants R722Y 
andR722U 

The plasmid pTne 1 06 (see example 1 6) was digested with Hpal and Kpnl 
and the 318 bp fragment was replaced with the corresponding fragment from 
pUCTneYA or pUCTneLA (see Example 17) to generate the mutants R722Y or 
R722L. In these constructs the amino acid at position 730 is the same as wild type 
Tne (phenylalanine). The constructs were sequenced to confirm the R722Y and 
the R722L mutations. The Tne DNA polymerase gene was then cloned as a 
SstUHindUl fragment into the plasmid pSportl . 
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Example 19: Generation of Tne DNA Polymerase mutants R722K, R722Q 
andR722H. 

The construct pTne 106 (see example 16) was digested with Hpal and 
Kpnl and the 3 1 8 bp fragment was replaced with the corresponding fragment from 
5 the construct P UC19TneFY-R722K, P UC19TneFY-R722H or pTnelO (see 

Example 14), to generate the mutants R722K, R722H and R722Q. The constructs 
were sequenced to confirm the mutations. The Tne DNA polymerase gene was 
then subcloned into the vector pSportl as a SsWHindUl fragment. 



Example 20: Purification of the mutant Tne DNA Polymerases 

10 The purification of the mutants of Tne DNA polymerase was carried out 

based on the method described above with minor modifications. Two to three 
grams of cells expressing cloned mutant Tne DNA polymerase were resuspended 
in 15-20 ml of sonication buffer (50 mM Tris-HCl , pH 8.0, 10% glycerol, 5mM 
2-mercaptoethanol, 50 mM NaCl, 1 mM EDTA, and 0.5 mM PMSF and 

15 sonicated with a 550 Sonic Dismembrator (Fisher Scientific). The sonicated 

sample was heated at 82°C for 20 min and then cooled in ice-water for 5 min. 
In the sample, 20 mM NaCl and 0.2% PEI were added and centrifuged at 13,000 
rmp for 1 0 min. Ammonium sulfate (305g/L) was added to the supernatant. The 
pellet was collected by centrifugation and resuspended in 4 ml of MonoQ column 

20 buffer (50 mM Tris-HCl, pH 8 .0, 1 0% glycerol, 5mM 2-mercaptoethanol, 50 mM 

NaCl, and 1 mM EDTA). The sample was dialyzed against one litter of MonoQ 
buffer overnight. Following the centrifugation at 13,000 rpm to remove any 
insoluble materials, the sample was loaded onto a MonoQ column (HR5/5, 
Pharmacia). The column was washed with MonoQ column buffer to baseline of 

25 OD^ and then eluted with a linear gradient of 50-300 mM NaCl in 20 ml MonoQ 

column buffer. The fractions were analyzed by 8% SDS-PAGE and the Tne DNA 



. WO 98/35060 



PCT/US98/02791 



-73- 

polymerase activity determined as described earlier. The fractions containing 
active and pure Tne DNA polymerase were pooled. 

Example 21: Generation of Taq DNA Polymerase Mutants R659K, R659H 
andR659Y 

5 A 2.5 kb portion of the gene encoding Taq DNA polymerase (Figure 5) 

was cloned as a Hind UhXba I fragment into M13mpl9. Site directed 
mutagenesis was performed using the BioRad mutagene kit (BioRad California) 
using the following oligonucleotides: 

CTTGGCCGCCCG4J(?CATCAGGGGGTC (SEQ ID NO:41) for the 

10 R659H mutation where an Nsil site was created (see bold italics); 

CTTGGCCGCCCGCTrC4 TGA GGGGGTCC AC (SEQ ID N0.42) for 
the R659K mutation where aAspHI site was created (see bold italics); and 

CTTGGCCGCCC TGTA CA TC AGGGGGTC (SEQ ID NO:43) for the 
R659Y mutation where aBsrGl site was created (see bold italics). 

15 For each mutation, six clones were screened by analyzing the M13RF 

DNA for the expected restriction sites. Mutations were confirmed by DNA 
sequencing. DNA shown to contain the mutation by the presence of the expected 
restriction site was digested with NgoAIV and Xba I and the approximately 1 600 
base pair fragment was used to replace corresponding fragment in the wildtype 

20 Taq DNA polymerase gene. These constructs were made in a plasmid containing 

Taq polymerase gene under the control of Tac promoter (pTTQ Taq) to generate 
pTTQ Taq (R659K), pTTQ Taq (R659H) and pTTQ Taq (R659Y). These 
plasmids were transformed into E. coli DH10B (LTI). 
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Example 22: Construction of Tne polymerase mutants containing F730S and 
F730T 

Single stranded DNA was isolated from pSportTne (Tne35) containing D137A 
and D323A mutations as described in the section 2 of example 13 . These D137 

5 and D323A mutations rendered Tne DNA polymerase devoid of S'-exonuclease 

and 3'-to-5'-exonuclease activities, respectively. Thus, Tne 35 is devoid of both 
exonuclease activities. The site-directed mutagenesis was done following the 
protocol decribed in section 3 ofExample 1 3. The oligos used were 5 1 GTA TAT 
TAT AGA GGA GTT AAC CAT CTT TCC 3 f (SEQ ID NO:37) for F730S and 

10 5' GTA TAT TAT AGA GGT GTT AAC CAT CTT TCC 3' (SEQ ID NO:44) 

for F730T. Each of these two oligos contain a diagonistic Hpal site for 
screening of mutants in the MutS strain. The mutant plasmids were transferred 
to DH1 OB strains. The mutations were finally confirmed by DNA sequencing. 
The mutant polymerases were purified by the procedure as described in Example 

15 20.. 

Example 23: Determination of the Activity of Non-templated One Base 
Addition for Tne and Taq DNA Polymerase by Primer 
Extension Assay 

The following 34-mer primer was 52 P labeled at the 5* end with [y- 32 P] 
20 ATP and T4 polynucleotide kinase by standard protocol (Molecular Cloning, A 

Laboratory Manual, Cold Spring Harbor, NY): 

S'-GGGAGACCGGAATTCTCCTTCATTAATTCCTATA^' (SEQ 1DN0:45) 

The unincorporated ATP was removed by a BioRad P6 column(1.0 ml). The 
labeled primer was annealed to the following homogenous (purified) 48-mer 
25 template: 
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5'-TGGAGACCCTGGAACTATAGGAATTAATGAAGGAGAATTCCGGT 
CTCCC-3' (SEQ ID NO:46). 

Wildtype or mutant DNA polymerases (0. 125-1 .0 unit) were incubated at 
72° C for 2 min in 20 mM Tris-HCl (pH8.3), 1 .5 mM MgCl* 50 mM KC1, 1.0 

5 mM DTT, 200 uM of dCTP, dGTP, TTP, dATP, and 0.02 pmol of the annealed 

primer-template. After addition of sequencing stop buffer and heated at 90°C for 
2 min, the mixture was loaded onto 1 0% polyacrylamide-7 M urea. Following the 
electrophoresis, the gel was dried and the reaction products were analyzed by 
autoradiography. The non-templated one base addition products shown in Figure 

10 6 were quantified by a Phosphorlmager (Molecular Dynamics). 



Tne DNA polymerases % of N+l 

1 D137A 185 

2 D137A D323A 78.5 
15 3 D137A D323A R722K 0.7 

4 D137A D323A R722Y 0.7 

5 D137A D323A R722L . 5.7 

6 D137A D323A R722H 12 
20 7 D137A D323A R722Q 1.4 

8 D137A D323A F730Y 61.3 

9 D137A D323A R722K F730Y 6.8 

10 D137A D323A R722H F730Y 2.1 

11 D137A D323A R722Q F730Y 6.1 
25 12 D137A D323A R722N F730Y 15.9 

13 D137A D323A F730S 8.3 

14 D137A D323A F730T 24.2 
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Taq DNA Polymerases 0/0 of N+1 

1 W.T. 37 

2 R659K 1-4 

3 R659Y 09 

4 R659H 05 

5 F667Y 39.1 



Example 24: Comparison of DNA Synthesis by Taq and Tne 

To examine its propensity to add a nontemplated nucleotides to the 3' 
termini of PCR products, Tne DNA polymerase (S'exo*, 3'exo) was compared 

10 side-by-side with Taq DNA polymerase in amplifications of short tandem repeats 

at 23 different marker loci (see Table 1). Reactions comprising 20 mM TRIS- 
HC1, pH 8.4, 50 mM KC1, 1.5 mM MgCl* 200 mM each dNTP, 200 nM [ 32 P] 
a-dATP, 200 nM each of the upper and lower primers, 25 ng of human DNA, 
0.1% nonionic detergent and 1 unit of DNA polymerase (in a volume of 25 ml) 

15 were assembled on ice. Published sequences for upper and lower primers for each 

locus, as shown in Table 1, were used for all amplifications. 

Reactions were loaded into a Perkin Elmer model 9600 thermocycler 
preheated to 94°C and PCR was done using standard cycling conditions (1 minute 
pre-denaturation at 94°C; 30 cycles of 30 seconds at 94°C, 30 seconds at 55°C, 

20 and 1 minute at 72°C; 1 minute post-extension at 72°C; overnight soak at 4°C). 

A portion of each reaction was mixed with an equal volume of 95% formamide 
containing dyes to indicate the progress of electrophoresis. Samples were heated 
to 90°C for 2 min, and 5 ml of each was loaded on a 6% denaturing 
polyacrylamide gel. Sequencing ladders were loaded to provide size markers, and 

25 electrophoresis was performed at 70 watts. After electrophoresis the gel was 

transferred to filter paper and dried. Autoradiography and phosphoimage analysis 
was performed to visualize the PCR products and estimate the percentage of 
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product which contained the added nucleotide by direct comparison of bands 
produced by each enzyme. 

Examples of the side-by-side comparisons of amplification products 
produced by Taq DNA polymerase and Tne DNA polymerase are shown in 
5 Figure 7 for the CD4 locus and in Figure 8 for the D20S27 locus. At both of 

these loci, a significant portion of the Taq PCR product contained an extra non- 
templated nucleotide (n+1), while Tne polymerase demonstrated no apparent non- 
templated nucleotide incorporation for either the CD4 locus (Figure 7) or the 
D20S27 locus (Figure 8). Complete results for the 23 marker loci examined are 
10 summarized in Table 2. In PCRs using Taq DNA polymerase, a portion of the 

amplification product contained an extra non-templated nucleotide (n+1) at every 
locus examined. In PCRs using Tne DNA polymerase, however, no detectable 
portion of the product at any of the loci examined contained an additional non- 
templated nucleotide. These results indicate that Tne DNA polymerase, in 
15 contrast to Taq DNA polymerase, is substantially reduced in the ability to add a 

nontemplated 3 1 terminal nucleotide to the growing strand. Since the Tne DNA 
polymerase used in these amplifications was a 3'exo- mutant (i.e., it was 
substantially reduced in 3' exonuclease activity), these results are consistent with 
the notion that the Tne polymerase was unable to add the extra nucleotide to the 
20 product rather than adding the nucleotide and then removing it via a 3' 

exonuclease activity. 
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Table 2. Non-templated 3 f Terminal Nucleotide Addition by Taq and Tne 
DNA Polymerases at 23 Microsatellite DNA Loci, 



Locus 


Kepeat type 


M Utf \ SO IItI^ 


7W r% n+l) 


D13S7I 


ainucieotiue 


100 


n 

V 


D1S103 


ainucieotiae 


75 inn 


n 

V 


D15S87 


dinucleotide 




n 


D2S136 


dinucleotide 


on inn 


n 


HUMCD4 


pentanucleotide 




n 


HUMPLA2A 


trinucleotide 




o 


D19S49 


dinucleotide 


75 




D4S175 


QmuCieOuue 


75 


o 


ArUUZAC 


uinucieouue 


50 


o 


DzUaz / 




50 


o 


Dl JOlZI 




100 


o 


TSAQ1QQ 




50 


o 


AT>fV"") 


UlllUWrlCUUUv 


50 


0 


D10S89 


dinucleotide 


75-100 


0 


HUMVWA 


tetranucleotide 


90 


0 


D16S401 


dinucleotide 


100 


0 


D7S440 


dinucleotide 


90 


0 j 


D4S174 


dinucleotide 


75 


0 


D16S520 


dinucleotide 


100 


0 


D16S511 


dinucleotide 


100 


0 


HUMD21S11 


tetranucleotide 


100 


0 


HUMTHOl 


tetranucleotide 


75 


0 


HUMACTBP2 


tetranucleotide 


25 


0 
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Example 25: Comparison ofDNA Synthesis by Tne and Other Thermostable 
Enzymes 

To further evaluate the differences in the propensities of Tne and other 
thermostable DNA polymerases to add non-templated 3' terminal nucleotides to 
PCR products, side-by-side amplifications were performed using a single marker 
locus Dl SI 03 and a variety of thermostable enzymes, including 3' exonuclease 
deficient (3'exo-) enzymes, and 3' exonuclease competent (3'exo+) enzymes. PCR 
amplifications, electrophoresis and analysis were performed as described for 
Example 24, using 200 nM of DlS103-specific upper and lower primers. 

Results for the amplifications using 3'exo- DNA polymerases are shown 
in Table 3. With the exception of 7>ie(3'exo-), all of the 3'exo- DNA polymerases 
examined exhibited a propensity to add a non-templated 3' terminal nucleotide 
(n+1) to the PCR product. For Taq and Tbr DNA polymerases, up to 100% of 
the PCR products contained an additional non-templated 3' terminal nucleotide, 
while Vent, Deep Vent, and Dtok 3'exo- mutants polymerases added this non- 
templated nucleotide to 25-100% of the PCR products! In contrast, the 3'exo- 
mutant of Tne DNA polymerase was substantially reduced in the ability to add a 
nontemplated 3 1 terminal nucleotide to the DNA molecule; none of the PCR 
products from reactions using 7he(3'exo-) had an additional non-templated 
nucleotide at their 3' termini. 

Results from amplifications using 3'exo+ DNA polymerases are shown in 
Table 4. Five polymerases were examined as well as two commercially available 
enzyme mixes (mixtures of a primary 3'exo- polymerase and a secondary 3'exo+ 
polymerase). At this locus, the 3'exo+ DNA polymerases (Tne, Tma y Pfu, Pwo 
and 9°North) yielded product which did not contain an extra non-templated 
nucleotide. The enzyme mixtures (Elongase and Expand HiFi) yielded a mixture 
of products with and without an additional non-templated nucleotide. Together, 
these results indicate that Tne polymerases, whether 3'exo- or 3*exo+, are 
substantially reduced in the ability to add a nontemplated 3' terminal nucleotide 
to the DNA molecule. Moreover, of the preferred 3'exo- polymerases, only 
7>n?(3'exo-) was substantially reduced in this activity, indicating its favorableness 
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in PCR applications where non-templated nucleotide addition to the amplification 
product is undesirable. 

Table 3. Non-templated 3* Terminal Nucleotide Addition by3'exo- 
5 DNA Polymerases. 



Enzyme 


n Sized Fragment 


n+1 Sized Fragment 


7M3'exo-) 


+ 




Taq 




+ 


Ventft'exo-) 


+ ; 


+ 


Deep Vent(3'exo-) 


+ 


+ 


Dtok(3'exo-) 


+ 


+ 


Thermolase7fcr 




+ 


Tablet Non-templated 3' Terminal Nucleotide Addition by 3'exo+ 
DNA Polymerases. 


Enzyme 


n Sized Fragment 


n+1 Sized Fragment 


rne(3 , exo+) 


+ 




\JiTma 


+ 




Pfu 






Pwo 


+ 




9°North 


+ 




Elongase 


+ 


+ 


Expand HiFi 


+ 


+ 



Example 26: Comparison of DNA Synthesis by Tne Mutants 



25 



To examine the utility of Tne DNA polymerase and various mutants 
thereof in amplification of microsatellite DNA sequences, the experiments 
described in Example 25 were repeated with 1 1 different Tne DNA polymerase 
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mutants. Of these mutants, 3 were 5*exo+, while the remainder were 5'exo- either 
due to N-terminal deletions of the protein, or to point mutations in the 5' 
exonuclease domain of the polymerase. 

As shown in Table 5, use of the 5*exo- Tne mutants resulted in productive 
5 amplifications, yielding PCR products with no non-templated 3 1 terminal 

nucleotide additions. Results were identical for all seven 7he(3'exo-/5'exo-) 
polymerase mutants, as well as for the single rwe(3'exo+/5'exo-) mutant tested. 
Results with 5'exo+ Tne mutants were inconclusive under the conditions tested. 

These results indicate that the mutants of Tne DNA polymerase tested in 
10 the present studies are substantially reduced in the ability to add nontemplated 3 1 

terminal nucleotides to the growing strand, particularly a DNA template 
comprising a microsatellite DNA sequence or an STR. 



Table 5. Non-templated 3' Terminal Nucleotide Addition by Tne DNA 
Polymerase Mutants 



Enzyme 


5'exo 
Activity 


3'exo 
Activity 


n Sized 
Fragment 


n+1 Sized 
Fragment 


TneN'A219, D323A 






+ 




TneN'A283,D323A 






+ 




TneN'A192, D323A 










TneD137A, D323A 






+ 




Tne D8A, D323A 






+ 




Tne G195D, D323A 






+ 




Tne G37D, D323A 






+ 




Tne N*A283 




+ 


+ 





98/35060 



PCT/US98/02791 



-83- 

Example 27: Fluorescent Analysis of DNA Synthesis by Tne and Taq DNA 
Polymerases 

In an alternative analysis approach, the propensities of Taq DNA 
polymerase and Trie DNA polymerase to add non-templated nucleotides to the 
PCR products were compared using fluorescent detection. The polymerases were 
compared in side-by-side amplifications utilizing a commonly used commercially 
available marker panel (ABI Prism Linkage Mapping Set Panel 21), examining ten 
different loci. Reaction mixtures (15 ml) containing 1.5 mM MgClj, 250 mM of 
each deoxynucleoside triphosphate, 333 nM of each primer, 50 ng of human DNA 
and 0.6 units of Taq or Tne DNA polymerase were assembled on ice. Reactions 
were loaded into a Perkin Elmer model 9600 thermocycler preheated to 95°C, and 
PCR was performed using recommended cycling conditions (5 minutes pre- 
denaturation at 95°C; 10 cycles of 15 seconds at 95°C, 15 seconds at 55°C, and 
60 seconds at 72°C; and 20 cycles of 1 5 seconds at 89°C, 1 5 seconds at 55°C, and 
60 seconds at 72°C). Two sets of extension reactions were conducted for each 
locus, one with a 10 minute post-extension incubation at 72°C followed by an 
overnight soak and storage at 4°C (conditions which favor nontemplated 3* 
nucleotide addition), the other with no post-extension incubation followed by 
immediate storage at -20°C (conditions which inhibit nontemplated 3 f nucleotide 
addition). A portion of each reaction was diluted, mixed with loading cocktail, 
heat denatured and loaded on an 8% polyacrylamide sequencing gel. The ABI 
373 Stretch Automated Sequencer was run for 5-6 hours at 15W in order to 
obtain single base resolution, and data were analyzed using GeneScan software. 
Areas of the peaks recognized by the software were used to estimate the 
percentage of nontemplated 3* nucleotide addition ("n+r) for each locus by the 
two polymerases under the two different extension conditions. The total area 
under the allelic peaks was used to compare the yields of specific PCR product 
obtained in Tne and Taq amplifications, and yields produced by Tne polymerase 
were expressed for each locus as a percentage of those produced by Taq 
polymerase. Table 6 summarizes the results obtained. 
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Table 6. Comparison of DNA Amplification by Tag and Tne DNA 
Polymerases by Fluorescent Detection 





locus 


color 


expected 
size 


cycling 
conditions 


Taq 
pattern 


Tne 

pattern 


Tne 

yield (% 
Taq) 


5 


D16S40 
5 


blue 


107-145 


no final ext 
10' final 
ext 


0% n+1 
94% n+1 


100% n 
100% n 


89% 

178% \ 




D15S12 

/ 


green 


114-148 


no final ext 

1H' final 

ext 


53% n+1 
100% 

n+1 


100% n 
100% n 


133% 
142% 




D16S52 
u 


yellow 


144-160 


no final ext 

IV IlUal 

ext 


40% n+1 

100% 

n+1 


98% n 
100% n 


275% 
252% 


10 


D16S51 
1 


green 


182-222 


no final ext 
10* final 
ext 


62% n+ 1 

100% 

n+1 


100% n 
100% n 


51% 
160% 




Til £CA1 

1 


DIUc 




nn final pvt 

10' final 
ext 

10' final 

ext 


0% n+1 
64% n+1 
67% n+1 


100% n 
100% n 
95% n 


218% 
257% 
305% 


15 


D15S13 
1 


yellow 


237-275 


no final ext 
10' final 
ext 


0% n+1 
69% n+1 


100% n 
95% n 


48% 
231% 




D15S13 
0 


blue 


280-294 


no final ext 


0% n+1 


100% n 


101% 




D16S50 
3 


yellow 


294-310 


no final ext 
10' final 
ext 


0% n+1 
73% n+1 


100% n 
100% n 


102% 
166% 


20 


D15S11 
7 


green 


316-334 


no final ext 
10' final 
ext 


17% n+1 
77% n+1 


100% n 
100% n 


130% 
326% 




D16S51 
5 


blue 


320-350 


no final ext 
10' final 
ext 


32% n+ 1 

100% 

n+1 


100% n 
100% n 


486% 
298% 
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The results shown in Table 6 confirm that under conditions favoring 
("10 f final ext") or inhibiting ("no final ext M ) 3' nontemplated nucleotide addition, 
Tne DNA polymerase produced PCR products that were 95-1 00% free from non- 
templated nucleotide addition (V) for each locus examined. Tag DNA 
polymerase, however, demonstrated significant addition of nontemplated 
nucleotides under inhibiting conditions in most loci tested, while under permissive 
conditions well over half, and in some cases all, of the PCR product produced by 
Taq DNA polymerase demonstrated an additional nontemplated 3' nucleotide. 
Furthermore, under most conditions the amount of PCR product yielded by Tne 
DNA polymerase was at least as high as that of Taq DNA polymerase, and for 
some loci was 3- to 4-fold higher. 

Figure 9 shows two examples of electropherogram gel scans, aligned by 
PCR product size, comparing the PCR products obtained with Taq and Tne 
polymerases with a 10-minute final extension. For the D15S153 locus, Taq 
exhibited non-templated nucleotide addition to 40% of the PCR product (Figure 
39), while Tne exhibited no such addition of non-templated nucleotides (Figure 
9B). Similar results were obtained with the D15S127 locus: 53% of the 7i7? PCR 
products demonstrated non-templated nucleotide addition (Figure 9C), while none 
of the Tne PCR products demonstrated non-templated nucleotide addition (Figure 
9D). These results demonstrate the difficulty in identifying alleles in a 
heterogeneous pattern as generated by Taq amplification, compared to the more 
homogeneous, simple pattern generated by amplification with Tne. 

Together with Examples 24-26, these results indicate that Tne DNA 
polymerase and the mutants thereof tested in the present studies are substantially 
reduced in the ability to add a nontemplated 3" terminal nucleotide to DNA 
templates, particularly DNA templates comprising microsatellite DNA sequences 
or STRs. Conversely, Taq DNA polymerase demonstrates significant addition of 
nontemplated 3' nucleotides to PCR products. 
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Example 28: Comparison ofTaq and Tne 

To examine the ability of a truncated form of Tne DNA polymerase 
(ISTA283, 5'exo-, 10% 3'exo activity) to add a nucleotide to the end of the PCR 
product, the enzyme was compared side-by-side with wild type Taq DNA 
polymerase in amplifications of short tandem repeats at 5 different marker loci. 
A portion of ABI Prism Linkage Mapping Set Panel 21 was used for the primer 
sets for the loci. 15 ul reactions (20 mM Tris-HCl, pH8.4, SOmMKCl, 1.5 mM 
MgCl 2 , 200 uM each dNTP, 333 nM each primer, 60 ng human DNA, 0.1% 
nonionic detergent, 0.6 U DNA polymerase) were assembled on ice. 

Reactions were loaded into a Perkin Elmer model 9600 thermocycler 
preheated to 95 °C and PCR was done using recommended cycling conditions (5 
min. pre-denaturation at 95°C; 10 cycles of 15 sec at 95°C, 15 sec at 55 °C, and 
60 sec at 72 °C; 20 cycles of 1 5 sec at 89°C, 15 sec at 55 °C, and 60 sec at 72°C; 
lOmin final extension at 72°C). A portion of each reaction was diluted, mixed 
with loading cocktail, heat denatured and loaded on a 8% sequencing gel. The 
ABI 373 Stretch Automated Sequencer was run for 5-6hr at 15W in order to 
obtain 1 base resolution. Data was analyzed using GeneScan software. Areas of 
the peaks recognized by the software were used to estimate the percent of 
extranucleotide addition. Table 7 summarizes the results obtained. Examples of 
the electropherogram data is shown in Figure 10. 



Table 7: Percent extranucleotide addition exhibited by Taq and Tne 
DNA polymerases at specific loci. 



Locus 


Taq(% n+1) 


Tne(% n+1) 


D16S405 


46 


0 


D16S401 


100 


45 


D16S520 


63 


0 


D15S131 


51 


0 


D16S411 


53 


0 
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Example 29: Comparison ofTne Mutants 

In order to evaluate the effect of amino acid substitutions in Tne DNA 
polymerase in regard to extra nucleotide addition, different mutations at position 
F730 in the untruncated polymerase were compared in side-by-side amplifications 
with 7i^(wild type) and a truncated Tne(tf&219, D323A, F730Y) utilizing a 
portion of ABI Prism Linkage Mapping Set Panel 21 . Six loci were examined. 
15 ul reactions (20 mM Tris-HCl, pH 8.4, 50 mM KC1, 1.5 mM MgCl 2 , 200 uM 
each dNTP, 333 nM each primer, 50-60 ng human DNA, 0.1% nonionic 
detergent, 0. 15-0.6 U DNA polymerase) were assembled on ice. 

Reactions were loaded into a Perkin Elmer model 9600 thermocycler 
preheated to 95 °C and PCR was done using recommended cycling conditions (5 
min. pre-denaturation at 95°C; 10 cycles of 15 sec at 95°C, 1 5 sec at 55 °C, and 
60 sec at 72 °C; 20 cycles of 15 sec at 89 °C, 1 5 sec at 55 °C, and 60 sec at 72°C; 
lOmin final extension at 72°C). A portion of each reaction was diluted, mixed 
with loading cocktail, heat denatured and loaded on a 8% sequencing gel. The 
ABI 373 Stretch Automated Sequencer was run for 5-6hr at 15W in order to 
obtain 1 base resolution. Data was analyzed using GeneScan software. Areas of 
the peaks recognized by the software were used to estimate the percent of 
extranucleotide addition. Table 8 summarizes the results obtained. An example 
of the electropherogram data is shown in Figure 1 1 . 
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Example 30: Comparison of Tne and Taq Mutants 

In order to evaluate the effect of amino acid substitution at position F667 
in Taq DNA polymerase(equivalent to F730 in Tne DNA polymerase) in regard 
to extra nucleotide addition, a commercially available mutant of Taq DNA 

5 polymerase (Taq FS) (NA3, G46D, F667Y) was compared in side-by-side 

amplifications with Taq DNA polymerase(wild type) and Tne-1 DNA 
polymerase(N'A219, D323A, F730Y). Three loci were examined (a portion of 
ABI Prism Linkage Mapping Set Panel 21). 1 5 ul reactions (20 mM Tris-HCl, pH 
8.4, 50 mM KC1, 1.5 mM MgCl 2 , 200 uM each dNTP, 333 nM each primer, 60 

10 ng human DNA, 0.1% nonionic detergent, 0.6 U DNA polymerase) were 

assembled on ice. 

Reactions were loaded into a Perkin Elmer model 9600 thermocycler 
preheated to 95 °C and PCR was done using recommended cycling conditions (5 
min. pre-denaturation at 95°C; 10 cycles of 15 sec at 95 °C, 15 sec at 55 °C, and 

15 60 sec at 72 °C; 20 cycles of 1 5 sec at 89 °C, 15 sec at 55 °C, and 60 sec at 72 °C; 

lOmin final extension at 72 °C). A portion of each reaction was diluted, mixed 
with loading cocktail, heat denatured and loaded on a 8% sequencing gel The 
ABI 373 Stretch Automated Sequencer was run for 5-6hr at 15W in order to 
obtain lbase resolution. Data was analyzed using GeneScan software. Areas of 

20 the peaks recognized by the software were used to estimate the percent of 

extranucleotide addition. Table 9 summarizes the results obtained. Examples of 
the electropherogram data are shown in Figure 12. 
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Tablc 9: Percent extranucleotide addition exhibited by Taq and Tne 
DNA polymerases at specific loci* 



locus 


Taq(% n+1) 


TaqFS(% n+1) 


Tne-1(% 
n+1) 


D16S4H 


48 


0 


0 


D15S127 


100 


31 


5 


D15S153 


100 


29 


0 



Example 31: Comparison of Tne Mutants 

In order to evaluate the effect of amino acid substitutions at position R722 
in Tne DNA polymerase in regard to extranucleotide addition, different mutations 

10 in the polymerase were compared in side-by-side amplifications utilizing a portion 

of ABI Prism Linkage Mapping Set Panel 21. Six loci were examined. 15 ul 
reactions (20 mM Tris-HCl, pH 8.4, 50 mM KC1, 1 .5 mM MgCl 2 , 200 uM each 
dNTP, 333 nM each primer, 50-60 ng human DNA, 0. 1 % nonionic detergent, 0.2- 
0.6 U DNA polymerase) were assembled on ice. 

15 Reactions were loaded into a Perkin Elmer model 9600 thermocycler 

preheated to 95 °C and PCR was done using recommended cycling conditions (5 
min.pre-denaturationat95°C; 10 cycles of 15 sec at 95 °C, 15 sec at 55 °C, and 
60 sec at 72°C; 20 cycles of 15 sec at 89°C, 15 sec at 55 °C, and 60 sec at 72 °C; 
lOmin final extension at 72 °C). A portion of each reaction was diluted, mixed 

20 with loading cocktail, heat denatured and loaded on a 8% sequencing gel. The 

ABI 373 Stretch Automated Sequencer was run for 5-6hr at 15W in order to 
obtain lbase resolution. Data was analyzed using GeneScan software. Heights 
of the n and n+1 peaks recognized by the software were used to estimate the 
percent of extranucleotide addition. Table 1 0 summarizes the results obtained. An 

25 example of the electropherogram data is shown in Figure 1 3 . 
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Example 32: Generation of Tne DNA Polymerase Mutant K726R 

The mutation of the Tne polymerase was done by essentially the same 
procedure as described above in Example 13. The single-stranded DNA was 
isolated from pSport-Tne containing D137A and D323A mutations. The 
5 oligonucleotide used for the mutagenesis was 5*-GAA GTT C AC CAT CCG GCC 

GAC CCG TCG CAT TTC 3* (SEQ ID NO:93). An XmalU site (bold italics in 
the above sequence) was introduced into the oligonucleotide for easy screening 
of the mutants. The mutation was confirmed by DNA sequencing. The clone was 
named pTnel29 (D137A, D323A, K726R). 

10 Example 33: Determination of the Activity of Non-templated One Base 

Addition for Tne DNA Polymerase and its Mutant D137A, 
D323A y K726R, by Primer Extension Assay 

The mutant Tne DNA polymerase {Tne D137A, D323A, K726R) 
prepared in Example 32 was purified as described in Example 20. The assay for 
15 non-templated one base addition was conducted as described in Example 23 . The 

results were as follows: 

Tne DNA Polymerase % of Product With N+l 

D137A,D323A 78.4 
D137A,D323A,R722H 1.7 
20 D137A, D323A, K726R 0.9 

These results demonstrate that mutation of the lysine residue at position 
726 of Tne, particularly to arginine, substantially reduces the activity of the 
polymerase in adding non-templated bases. 

Having now folly described the present invention in some detail by way of 
25 illustration and example for purposes of clarity of understanding, it will be obvious 
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to one of ordinary skill in the art that the same can be performed by modifying or 
changing the invention within a wide and equivalent range of conditions, 
formulations and other parameters without affecting the scope of the invention or 
any specific embodiment thereof, and that such modifications or changes are 

5 intended to be encompassed within the scope of the appended claims. 

All publications, patents and patent applications mentioned in this 
specification are indicative of the level of skill of those skilled in the art to which 
this invention pertains, and are herein incorporated by reference to the same extent 
as if each individual publication, patent or patent application was specifically and 

10 individually indicated to be incorporated by reference. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Life Technologies, Inc. 

(B) STREET: 9800 Medical Center Drive 

(C) CITY: Rockville 

(D) STATE: Maryland 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP) : 20850 

(ii) TITLE OF INVENTION: Polymerases for Analyzing or Typing Polymorphic 
Nucleic Acid Fragments and Uses Thereof 

(iii) NUMBER OF SEQUENCES: 93 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (BPO) 

(v) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: (To be assigned) 

(B) FILING DATE: (Herewith) 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US (To be assigned) 

(B) FILING DATE: 06 -JAN- 199 8 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/037,393 

(B) FILING DATE: 07-FEB-1997 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2682 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 
ATGGCGAGAC TATTTCTCTT TGATGGCACA GCCCTGGCCT ACAGGGCATA TTACGCCCTC 60 
GACAGATCCC TTTCCACATC CACAGGAATT CCAACGAACG CCGTCTATGG CGTTGCCAGG 120 
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ATGCTCGTTA AATTCATTAA GGAACACATT ATACCCGAAA AGGACTACGC GGCTGTGGCC 180 

TTCGACAAGA AGGCAGCGAC GTTCAGACAC AAACTGCTCG TAAGCGACAA GGCGCAAAGG 240 

CCAAAGACTC CGGCTCTTCT AGTTCAGCAG CTACCTTACA TCAAGCGGCT GATAGAAGCT 300 

CTTGGTTTCA AAGTGCTGGA GCTGGAGGGA TACGAAGCAG ACGATATCAT CGCCACGCTT 360 

GCAGTCAGGG CTGCACGTTT TTTGATGAGA TTTTCATTAA TAACCGGTGA CAAGGATATG 420 

CTTCAACTTG TAAACGAGAA GATAAAGGTC TGGAGAATCG TCAAGGGGAT ATCGGATCTT . 480 

GAGCTTTACG ATTCGAAAAA GGTGAAAGAA AGATACGGTG TGGAACCACA TCAGATACCG 540 

GATCTTCTAG CACTGACGGG AGACGACATA GACAACATTC CCGGTGTAAC GGGAATAGGT 600 

GAAAAGACCG CTGTACAGCT TCTCGGCAAG TATAGAAATC TTGAATACAT TCTGGAGCAT 660 

GCCCGTGAAC TCCCCCAGAG AGTGAGAAAG GCTCTCTTGA GAGACAGGGA AGTTGCCATC 720 

CTCAGTAAAA AACTTGCAAC TCTGGTGACG AACGCACCTG TTGAAGTGGA CTGGGAAGAG 780 

ATGAAATACA GAGGATACGA CAAGAGAAAA CTACTTCCGA TATTGAAAGA ACTGGAGTTT 840 

GCTTCCATCA TGAAGGAACT TCAACTGTAC GAAGAAGCAG AACCCACCGG ATACGAAATC 900 

GTGAAGGATC ATAAGACCTT CGAAGATCTC ATCGAAAAGC TGAAGGAGGT TCCATCTTTT 960 

GCCCTGGACC TTGAAACGTC CTCCCTTGAC CCGTTCAACT GTGAGATAGT CGGCATCTCC 1020 

GTGTCGTTCA AACCGAAAAC AGCTTATTAC ATTCCACTTC ATCACAGAAA CGCCCAGAAT 1080 

CTTGATGAAA CACTGGTGCT GTCGAAGTTG AAAGAGATCC TCGAAGACCC GTCTTCGAAG 1140 

ATTGTGGGTC AGAACCTGAA GTACGACTAC AAGGTTCTTA TGGTAAAGGG TATATCGCCA 1200 

GTTTATCCGC ATTTTGACAC GATGATAGCT GCATATTTGC TGGAGCCAAA CGAGAAAAAA 1260 

TTCAATCTCG AAGATCTGTC TTTGAAATTT CTCGGATACA AAATGACGTC TTATCAGGAA 1320 

CTGATGTCGT TTTCCTCACC ACTTTTTGGT TTCAGCTTTG CGGATGTTCC GGTAGACAAG 1380 

GCTGCGAACT ACTCCTGCGA GGATGCAGAC ATCACTTATA GGCTCTACAA GATACTCAGC 1440 

ATGAAGCTCC ATGAAGCGGA ACTTGAGAAC GTCTTCTACA GGATAGAGAT GCCGTTGGTG 1500 

AACGTTCTTG CACGCATGGA ATTGAACGGG GTGTATGTGG ACACAGAATT CCTGAAAAAG 1560 

CTCTCGGAGG AGTACGGCAA AAAGCTCGAG GAACTGGCCG AAAAAATCTA CCAGATAGCA 1620 

GGTGAGCCCT TCAACATCAA TTCTCCAAAA CAGGTTTCAA AGATCCTTTT TGAGAAGCTG 1680 

GGAATAAAAC CCCGTGGAAA AACGACAAAA ACAGGAGAGT ACTCTACCAG GATAGAGGTG 174 0 

TTGGAAGAGA TAGCGAATGA GCACGAGATA GTACCCCTCA TTCTCGAGTA CAGAAAGATC 1800 
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CAGAAACTGA AATCGACCTA CATAGACACC CTTCCGAAAC TTGTGAACCC GAAAACCGGA 1860 

AGAATTCATG CATCTTTCCA CCAGACGGGT ACCGCCACTG GCAGGTTGAG TAGCAGTGAT 1920 

CCAAATCTTC AGAATCTTCC GACAAAGAGC GAAGAGGGAA AAGAAATTAG AAAAGCGATT 1980 

GTGCCCCAGG ATCCAGACTG GTGGATCGTC AGTGCGGATT ATTCCCAAAT AGAACTCAGA 2040 

ATCCTCGCTC ATCTCAGTGG TGATGAGAAC CTTGTGAAGG CCTTCGAGGA GGGCATCGAT 2100 

GTGCACACCT TGACTGCCTC CAGGATCTAC AACGTAAAGC CAGAAGAAGT GAACGAAGAA 2160 

ATGCGACGGG TTGGAAAGAT GGTGAACTTC TCTATAATAT ACGGTGTCAC ACCGTACGGT 2220 

CTTTCTGTGA GACTTGGAAT ACCGGTTAAA GAAGCAGAAA AGATGATTAT CAGCTATTTC 2280 

ACACTGTATC CAAAGGTGCG AAGCTACATC CAGCAGGTTG TTGCAGAGGC AAAAGAGAAG 2340 

GGCTACGTCA GGACTCTCTT TGGAAGAAAA AGAGATATTC CCCAGCTCAT GGCAAGGGAC 2400 

AAGAACACCC AGTCCGAAGG CGAAAGAATC GCAATAAACA CCCCCATTCA GGGAACGGCG 2460 

GCAGATATAA TAAAATTGGC TATGATAGAT ATAGACGAGG AGCTGAGAAA AAGAAACATG 2520 

AAATCCAGAA TGATCATTCA GGTTCATGAC GAACTGGTCT TCGAGGTTCC CGATGAGGAA 2580 
AAAGAAGAAC TAGTTGATCT GGTGAAGAAC AAAATGACAA ATGTGGTGAA ACTCTCTGTG . 2640 

CCTCTTGAGG TTGACATAAG CATCGGAAAA AGCTGGTCTT GA 2682 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 893 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Ala Arg Leu Phe Leu Phe Asp Gly Thr Ala Leu Ala Tyr Arg Ala 
1^5 10 15 

Tyr Tyr Ala Leu Asp Arg Ser Leu Ser Thr Ser Thr Gly lie Pro Thr 
20 25 30 

Asn Ala Val Tyr Gly Val Ala Arg Met Leu Val Lys Phe He Lys Glu 
35 40 45 
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His lie He Pro Glu Lys Asp Tyr Ala Ala Val Ala Phe Asp Lys Lys 
50 55 60 

Ala Ala Thr Phe Arg His Lys Leu Leu Val Ser Asp Lys Ala Gin Arg 
65 70 75 80 

Pro Lys Thr Pro Ala Leu Leu Val Gin Gin Leu Pro Tyr He Lys Arg 

85 90 95 

Leu He Glu Ala Leu Gly Phe Lys Val Leu Glu Leu Glu Gly Tyr Glu 
100 105 HO 

Ala Asp Asp He He Ala Thr Leu Ala Val Arg Ala Ala Arg Phe Leu 
115 120 125 

Met Arg Phe Ser Leu He Thr Gly Asp Lys Asp Met Leu Gin Leu Val 
. 130 135 140 

Asn Glu Lys He Lys Val Trp Arg He Val Lys Gly He Ser Asp Leu 
145 150 155 160 

Glu Leu Tyr Asp Ser Lys Lys Val Lys Glu Arg Tyr Gly Val Glu Pro 
165 170 175 

His Gin He Pro Asp Leu Leu Ala Leu Thr Gly Asp Asp He Asp Asn 
180 185 190 

He Pro Gly Val Thr Gly He Gly Glu Lys Thr Ala Val Gin Leu Leu 
195 200 205 

Gly Lys Tyr Arg Asn Leu Glu Tyr He Leu Glu His Ala Arg Glu Leu 
210 215 220 

Pro Gin Arg Val Arg Lys Ala Leu Leu Arg Asp Arg Glu Val Ala He 
225 230 235 240 

Leu Ser Lys Lys Leu Ala Thr Leu Val Thr Asn Ala Pro Val Glu Val 
245 '250 255 

Asp Trp Glu Glu Met Lys Tyr Arg Gly Tyr Asp Lys Arg Lys Leu Leu 
260 265 270 

Pro He Leu Lys Glu Leu Glu Phe Ala Ser He Met Lys Glu Leu Gin 
275 280 285 

Leu Tyr Glu Glu Ala Glu Pro Thr Gly Tyr Glu He Val LyB Asp His 
290 295 300 

Lys Thr Phe Glu Asp Leu He Glu Lys Leu Lys Glu Val Pro Ser Phe 
305 310 315 320 

Ala Leu Asp Leu Glu Thr Ser Ser Leu Asp Pro Phe Asn Cys Glu He 
325 330 335 
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Val Gly He Ser Val Ser Phe Lys Pro Lys Thr Ala Tyr Tyr He Pro 
340 345 350 

Leu His His Arg Asn Ala Gin Asn Leu Asp Glu Thr Leu Val Leu Ser 
355 360 365 

Lys Leu Lys Glu He Leu Glu Asp Pro Ser Ser Lys He Val Gly Gin 
370 375 380 

Asn Leu Lys Tyr Asp Tyr Lys Val Leu Met Val Lys Gly He Ser Pro 
385 390 395 400 

Val Tyr Pro His Phe Asp Thr Met He Ala Ala Tyr Leu Leu Glu Pro 
405 410 415 

Asn Glu Lys Lys Phe Asn Leu Glu Asp Leu Ser Leu Lys Phe Leu Gly 
420 425 430 

Tyr Lys Met Thr Ser Tyr Gin Glu Leu Met Ser Phe Ser Ser Pro Leu 
435 440 445 

Phe Gly Phe Ser Phe Ala Asp Val Pro Val Asp Lys Ala Ala Asn Tyr 
450 455 460 

Ser Cys Glu Asp Ala Asp He Thr Tyr Arg Leu Tyr Lys He Leu Ser 
465 470 475 480 

Met Lys Leu His Glu Ala Glu Leu Glu Asn Val Phe Tyr Arg lie Glu 
485 490 495 

Met Pro Leu Val Asn Val Leu Ala Arg Met Glu Leu Asn Gly Val Tyr 
500 505 510 

Val Asp Thr Glu Phe Leu Lys Lys Leu Ser Glu Glu Tyr Gly Lys Lys 
515 520 525 

Leu Glu Glu Leu Ala Glu Lys He Tyr Gin He Ala Gly Glu Pro Phe 
530 535 540 

Asn He Asn Ser Pro Lys Gin Val Ser Lys He Leu Phe Glu Lys Leu 
545 550 555 560 

Gly He Lys Pro Arg Gly Lys Thr Thr Lys Thr Gly Glu Tyr Ser Thr 
565 570 575 

Arg He Glu Val Leu Glu Glu He Ala Asn Glu His Glu He Val Pro 
580 585 590 

Leu He Leu Glu Tyr Arg Lys He Gin Lys Leu Lys Ser Thr Tyr He 
595 600 605 

Asp Thr Leu Pro Lys Leu Val Asn Pro Lys Thr Gly Arg He His Ala 
610 615 620 

Ser Phe His Gin Thr Gly Thr Ala Thr Gly Arg Leu Ser Ser Ser Asp 
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625 630 635 640 

Pro Asn Leu Gin Asn Leu Pro Thr Lye Ser Glu Glu Gly Lys Glu He 
645 650 655 

Arg Lys Ala He Val Pro Gin Asp Pro Asp Trp Trp He Val Ser Ala 
660 665 670 

Asp Tyr Ser Gin He Glu Leu Arg He Leu Ala His Leu Ser Gly Asp 
675 680 685 

Glu Asn Leu Val Lys Ala Phe Glu Glu Gly He Asp Val His Thr Leu 
690 695 700 

Thr Ala Ser Arg He Tyr Asn Val Lys Pro Glu Glu Val Asn Glu Glu 
705 710 715 720 

Met Arg Arg Val Gly Lys Met Val Asn Phe Ser He He Tyr Gly Val 
725 730 735 

Thr Pro Tyr Gly Leu Ser Val Arg Leu Gly He Pro Val Lys Glu Ala 
740 745 750 

Glu Lys Met He He Ser Tyr Phe Thr Leu Tyr Pro Lys Val Arg Ser 
755 760 765 

Tyr He Gin Gin Val Val Ala Glu Ala Lys Glu Lys Gly Tyr Val Arg 
770 775 780 

Thr Leu Phe Gly Arg Lys Arg Asp He Pro Gin Leu Met Ala Arg Asp 
785 790 795 800 

Lys Asn Thr Gin Ser Glu Gly Glu Arg He Ala He Asn Thr Pro He 
805 810 815 

Gin Gly Thr Ala Ala Asp He He Lys Leu Ala Met He Asp He Asp 
820 825 630 

Glu Glu Leu Arg Lys Arg Asn Met Lys Ser Arg Met He He Gin Val 
835 840 845 

His Asp Glu Leu Val Phe Glu Val Pro Asp Glu Glu Lys Glu Glu Leu 
850 855 860 

Val Asp Leu Val Lye Asn Lys Met Thr Asn Val Val Lys Leu Ser Val 
865 870 875 880 

Pro Leu Glu Val Asp He Ser He Gly Lys Ser Trp Ser 
885 890 

(2) INFORMATION FOR SEQ ID N0:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 677 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 



<ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

5 Met Ser Leu His Ala Arg Glu Leu Pro Gin Arg Val Arg Lys Ala Leu 

15 10 15 

Leu Arg Asp Arg Glu Val Ala He Leu Ser Lys Lys Leu Ala Thr Leu 
20 25 30 

Val Thr Asn Ala Pro Val Glu Val Asp Trp Glu Glu Met Lys Tyr Arg 
10 35 40 45 

Gly Tyr Asp Lys Arg Lys Leu Leu Pro He Leu Lys Glu Leu Glu Phe 
50 55 60 

Ala Ser He Met Lys Glu lieu Gin Leu Tyr Glu Glu Ala Glu Pro Thr 
65 70 75 80 

15 Gly Tyr Glu He Val Lys Asp His Lys Thr Phe Glu Asp Leu He Glu 

85 90 95 

Lys Leu Lys Glu Val Pro Ser Phe Ala Leu Ala Leu Glu Thr Ser Ser 
100 105 HO 

Leu Asp Pro Phe Asn Cys Glu He Val Gly He Ser Val Ser Phe Lys 
20 115 120 125 

Pro Lys Thr Ala Tyr Tyr He Pro Leu His His Arg Asn Ala Gin Asn 
130 135 140 

Leu Asp Glu Thr Leu Val Leu Ser Lys Leu Lys Glu He Leu Glu Asp 
145 150 155 160 

25 Pro Ser Ser Lys He Val Gly Gin Asn Leu Lys Tyr Asp Tyr Lys Val 

165 170 175 

Leu Met Val Lys Gly He Ser Pro Val Tyr Pro His Phe Asp Thr Met 
180 185 190 

He Ala Ala Tyr Leu Leu Glu Pro Asn Glu Lys Lys Phe Asn Leu Glu 
30 195 200 205 

Asp Leu Ser Leu Lys Phe Leu Gly Tyr Lys Met Thr Ser Tyr Gin Glu 
210 215 220 

Leu Met Ser Phe Ser Ser Pro Leu Ph Gly Phe Ser Phe Ala Asp Val 
225 230 235 240 
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Pro Val Asp Lys Ala Ala Asn Tyr Ser Cys Glu Asp Ala Asp He Thr 
245 250 255 

Tyr Arg Leu Tyr Lys He Leu Ser Met Lys Leu His Glu Ala Glu Leu 
260 265 270 

Glu Asn Val Phe Tyr Arg He Glu Met Pro Leu Val Asn Val Leu Ala 
275 280 285 

Arg Met Glu Leu Asn Gly Val Tyr Val Asp Thr Glu Phe Leu Lys Lys 
290 295 300 

Leu Ser Glu Glu Tyr Gly Lys Lys Leu Glu Glu Leu Ala Glu Lys He 
305 310 315 320 

Tyx Gin He Ala Gly Glu Pro Phe Asn He Asn Ser Pro Lys Gin Val 
325 330 335 

Ser Lys He Leu Phe Glu Lys Leu Gly He Lys Pro Arg Gly Lys Thr 
340 345 350 

Thr Lys Thr Gly Glu Tyr Ser Thr Arg He Glu Val Leu Glu Glu He 
355 360 365 

Ala Asn Glu His Glu He Val Pro Leu He Leu Glu Tyr Arg Lys He 
370 375 380 

Gin Lys Leu Lys Ser Thr Tyr He Asp Thr Leu Pro Lys Leu Val .Asn 

395 400 



385 



390 



Pro Lys Thr Gly Arg He His Ala Ser Phe His Gin Thr Gly Thr Ala 
405 410 415 

Thr Gly Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn Leu Pro Thr 
420 425 430 

Lys Ser Glu Glu Gly Lys Glu He Arg Lys Ala He Val Pro Gin Asp 
435 440 445 

Pro Asp Trp Trp He Val Ser Ala Asp Tyr Ser Gin He Glu Leu Arg 
450 455 460 

He Leu Ala His Leu Ser Gly Asp Glu Asn Leu Val Lys Ala Phe Glu 
465 470 475 480 

Glu Gly He Asp Val His Thr Leu Thr Ala Ser Arg He Tyr Asn Val 
485 490 495 

Lys Pro Glu Glu Val Asn Glu Glu Met Arg Arg Val Gly Lys Met Val 
500 505 510 

Asn Phe Ser He He Tyr Gly Val Thr Pro Tyr Gly Leu Ser Val Arg 
515 520 525 

Leu Gly He Pro Val Lys Glu Ala Glu Lys Met He He Ser Tyr Phe 
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530 535 540 

Thr Leu Tyr Pro Lys Val Arg Ser Tyr He Gin Gin Val Val Ala Glu 
545 550 555 560 

Ala Lys Glu Lys Gly Tyr Val Arg Thr Leu Phe Gly Arg Lys Arg Asp 
5 565 570 575 

He Pro Gin Leu Met Ala Arg Asp Lys Asn Thr Gin Ser Glu Gly Glu 
580 585 590 

Arg He Ala He Asn Thr Pro lie Gin Gly Thr Ala Ala Asp He He 
595 600 605 

10 Lys Leu Ala Met He Asp He Asp Glu Glu Leu Arg Lys Arg Asn Met 

610 615 620 

Lys Ser Arg Met He He Gin Val His Asp Glu Leu Val Phe Glu Val 
625 630 635 640 

Pro Asp Glu Glu Lys Glu Glu Leu Val Asp Leu Val Lys Asn Lys Met 
15 645 650 655 

Thr Asn Val Val Lys Leu Ser Val Pro Leu Glu Val Asp He Ser He 
660 665 670 

Gly Lys Ser Trp Ser 
675 

20 (2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 610 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 
25 (D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Met Lys Glu Leu Gin Leu Tyr Glu Glu Ala Glu Pro Thr Gly Tyr Glu 
15 10 15 

30 He Val Lys Asp His Lys Thr Phe Glu Asp Leu He Glu Lys Leu Lys 

20 25 30 

Glu Val Pro Ser Phe Ala Leu Ala Leu Glu Thr Ser Ser Leu Asp Pro 
35 40 45 

Phe Asn Cys Glu He Val Gly He Ser Val Ser Phe Lys Pro Lys Thr 
35 50 55 60 
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Ala Tyr Tyr He Pro Leu His His Arg Asn Ala Gin Asn Leu Asp Glu 
65 70 75 80 

Thr Leu Val Leu Ser Lys Leu Lys Glu He Leu Glu Asp Pro Ser Ser 
85 90 95 

5 Lys He Val Gly Gin Asn Leu Lys Tyr Asp Tyr Lys Val Leu Met Val 

100 105 HO 

Lys Gly He Ser Pro Val Tyr Pro His Phe Asp Thr Met He Ala Ala 
115 120 125 

Tyr Leu Leu Glu Pro Asn Glu Lys Lys Phe Asn Leu Glu Asp Leu Ser 
10 130 135 140 

Leu Lys Phe Leu Gly Tyr Lys Met Thr Ser Tyr Gin Glu Leu Met Ser 
145 150 155 160 

Phe Ser Ser Pro Leu Phe Gly Phe Ser Phe Ala Asp Val Pro Val Asp 
165 170 175 

15 Lys Ala Ala Asn Tyr Ser Cys Glu Asp Ala Asp lie Thr Tyr Arg Leu 

180 185 190 

Tyr Lys He Leu Ser Met Lys Leu His Glu Ala Glu Leu Glu Asn Val 
195 200 205 

Phe Tyr Arg He Glu Met Pro Leu Val Asn Val Leu Ala Arg Met Glu 
20 210 215 220 

Leu Asn Gly Val Tyr Val Asp Thr Glu Phe Leu Lys Lys Leu Ser Glu 
225 230 235 240 

Glu Tyr Gly Lys Lys Leu Glu Glu Leu Ala Glu Lys He Tyr Gin He 
245 250 255 

25 Ala Gly Glu Pro Phe Asn He Asn Ser Pro Lys Gin Val Ser Lys He 

260 265. 270 

Leu Phe Glu Lys Leu Gly He Lys Pro Arg Gly Lys Thr Thr Lys Thr 
275 280 285 

Gly Glu Tyr Ser Thr Arg He Glu Val Leu Glu Glu He Ala Asn Glu 
30 290 295 300 

His Glu He Val Pro Leu He Leu Glu Tyr Arg Lys He Gin Lys Leu 
305 310 315 320 

Lys Ser Thr Tyr He Asp Thr Leu Pro Lys Leu Val Asn Pro Lys Thr 
325 330 335 

35 Gly Arg He His Ala Ser Phe His Gin Thr Gly Thr Ala Thr Gly Arg 

340 345 350 

Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn Leu Pro Thr LyB Ser Glu 
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355 360 365 

Glu Gly Lys Glu He Arg Lye Ala He Val Pro Gin Asp Pro Asp Trp 
370 375 380 

Trp He Val Ser Ala Asp Tyr Ser Gin He Glu Leu Arg He Leu Ala 
385 390 395 400 

His Leu Ser Gly Asp Glu Asn Leu Val Lys Ala Phe Glu Glu Gly He 
405 410 415 

ABp Val His Thr Leu Thr Ala Ser Arg He Tyr Asn Val Lys Pro Glu 
420 425 430 

Glu Val Asn Glu Glu Met Arg Arg Val Gly Lys Met Val Asn Phe Ser 
435 440 445 



He He Tyr Gly Val Thr Pro Tyr Gly Leu Ser Val Arg Leu Gly He 
450 455 460 

Pro Val Lys Glu Ala Glu Lys Met He He Ser Tyr Phe Thr Leu Tyr 
465 470 475 480 

Pro Lys Val Arg Ser Tyr He Gin Gin Val Val Ala Glu Ala Lys Glu 
485 490 495 

Lys Gly Tyr Val Arg Thr Leu Phe Gly Arg Lys Arg Asp He Pro Gin 
500 505 510 

Leu Met Ala Arg Asp Lys Asn Thr Gin Ser Glu Gly Glu Arg He Ala 
515 520 525 

He Asn Thr Pro He Gin Gly Thr Ala Ala Asp He He Lys Leu Ala 
530 535 540 

Met He Asp He Asp Glu Glu Leu Arg Lys Arg Asn Met Lys Ser Arg 
545 550 555 560 

Met He He Gin Val His Asp Glu Leu Val Phe Glu Val Pro Asp Glu 
565 570 575 

Glu Lys Glu Glu Leu Val Asp Leu Val Lys Asn Lys Met Thr Asn Val 
580 585 590 

Val Lys Leu Ser Val Pro Leu Glu Val Asp He Ser He Gly Lys Ser 
595 600 605 

Trp Ser 
610 

(2) INFORMATION FOR SEQ ID NO: 5: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 708 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Asn Ser Ser Ser Val Pro He Pro Gly Val Thr Gly He Gly Glu 
15 10 15 

Lys Thr Ala Val Gin Leu Leu Gly Lys Tyr Arg Asn Leu Glu Tyr He 
20 25 30 



Leu Glu His Ala Arg Glu Leu Pro Gin Arg Val Arg Lys Ala Leu Leu 
35 40 45 

Arg Asp Arg Glu Val Ala He Leu Ser Lys Lys Leu Ala Thr Leu Val 
50 55 60 

Thr Asn Ala Pro Val Glu Val Asp Trp Glu Glu Met Lys Tyr Arg Gly 
65 70 75 80 

Tyr Asp Lys Arg Lys Leu Leu Pro He Leu Lys Glu Leu Glu Phe Ala 
85 90 ' 95 

Ser He Met Lys Glu Leu Gin Leu Tyr Glu Glu Ala Glu Pro Thr Gly 
100 105 HO 

Tyr Glu He Val Lys Asp His Lys Thr Phe Glu Asp Leu He Glu Lys 
115 120 125 

Leu Lys Glu Val Pro Ser Phe Ala Leu Ala Leu Glu Thr Ser Ser Leu 
130 135 140 

Asp Pro Phe Asn Cys Glu He Val Gly He Ser Val Ser Phe Lys Pro 
145 150 155 160 

Lys Thr Ala Tyr Tyr He Pro Leu His His Arg Asn Ala Gin Asn Leu 
165 170 175 

Asp Glu Thr Leu Val Leu Ser Lys Leu Lys Glu He Leu Glu Asp Pro 
180 185 190 

Ser Ser Lys He Val Gly Gin Asn Leu Lys Tyr Asp Tyr Lys Val Leu 
195 200 205 

Met Val Lys Gly He Ser Pro Val Tyr Pro His Phe Asp Thr Met He 
210 215 220 

Ala Ala Tyr Leu Leu Glu Pro Asn Glu Lys Lys Phe Asn Leu Glu Asp 
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225 230 235 240 

Leu Ser Leu Lys Phe Leu Gly Tyr Lys Met Thr Ser Tyr Gin Glu Leu 
245 250 255 

Met Ser Phe Ser Ser Pro Leu Phe Gly Phe Ser Phe Ala Asp Val Pro 
260 265 270 

Val Asp Lys Ala Ala Asn Tyr Ser Cys Glu Asp Ala Asp He Thr Tyr 
275 280 285 

Arg Leu Tyr Lys He Leu Ser Met Lys Leu His Glu Ala Glu Leu Glu 
290 295 300 

Asn Val Phe Tyr Arg He Glu Met Pro Leu Val Asn Val Leu Ala Arg 
305 310 315 320 



Met Glu Leu Asn Gly Val Tyr Val Asp Thr Glu Phe Leu Lys Lys Leu 
325 330 335 

Ser Glu Glu Tyr Gly Lys Lys Leu Glu Glu Leu Ala Glu Lys He Tyr 
340 " 345 350 

Gin He Ala Gly Glu Pro Phe Asn He Asn Ser Pro Lys Gin Val Ser 
355 360 365 

Lys He Leu Phe Glu Lys Leu Gly He Lys Pro Arg Gly Lys Thr Thr 
370 375 380 ' 

Lys Thr Gly Glu Tyr Ser Thr Arg He Glu Val Leu Glu Glu He Ala 
385 390 395 400 

Asn Glu His Glu He Val Pro Leu He Leu Glu Tyr Arg Lys He Gin 
405 410 415 

Lys Leu Lys Ser Thr Tyr He Asp Thr Leu Pro Lys Leu Val Asn Pro 
420 425 430 

Lys Thr Gly Arg He His Ala Ser Phe His Gin Thr Gly Thr Ala Thr 
435 440 445 

Gly Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn Leu Pro Thr Lys 
450 455 460 

Ser Glu Glu Gly Lys Glu He Arg LyB Ala He Val Pro Gin Asp Pro 
465 " 470 475 480 

Asp Trp Trp He Val Ser Ala Asp Tyr Ser Gin He Glu Leu Arg He 
485 490 495 

Leu Ala His Leu Ser Gly Asp Glu Asn Leu Val Lys Ala Phe Glu Glu 
500 505 510 

Gly He Asp Val His Thr Leu Thr Ala Ser Arg II Tyr Asn Val Lys 
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515 



520 



525 



10 



Pro Glu Glu Val Asn Glu Glu Met Arg Arg Val Gly Lys Met Val Asn 
530 535 540 

Phe Ser He He Tyr Gly Val Thr Pro Tyr Gly Leu Ser Val Arg Leu 
545 550 555 560 

Gly He Pro Val Lys Glu Ala Glu Lys Met He He Ser Tyr Phe Thr 
565 570 575 

Leu Tyr Pro Lys Val Arg Ser Tyr He Gin Gin Val Val Ala Glu Ala 
580 585 590 

Lys Glu Lys Gly Tyr Val Arg Thr Leu Phe Gly Arg Lys Arg Asp He 
595 600 605 



Pro Gin Leu Met Ala Arg Asp Lys Asn Thr Gin Ser Glu Gly Glu Arg 
610 615 620 

He Ala He Asn Thr Pro He Gin Gly Thr Ala Ala Asp He He Lys 
15 625 630 635 640 

Leu Ala Met He Asp He Asp Glu Glu Leu Arg Lys Arg Asn Met Lys 
645 650 655 

Ser Arg Met He He Gin Val His Asp Glu Leu Val Phe Glu Val Pro 
660 665 670 

20 Asp Glu Glu Lys Glu Glu Leu Val Asp Leu Val Lys Asn LyB Met Thr 

675 680 685 

Asn Val Val Lys Leu Ser Val Pro Leu Glu Val Asp He Ser He Gly 
690 695 700 



Lys Ser Trp Ser 
25 705 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 893 amino acids 
(B) TYPE: amino acid 
30 <C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Ala Arg Leu Phe Leu Phe Asp Gly Thr Ala Leu Ala Tyr Arg Ala 
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15 10 15 

Tyr Tyr Ala Leu Asp Arg Ser Leu Ser Thr Ser Thr Gly He Pro Thr 
20 25 30 

Aen Ala Val Tyr Gly Val Ala Arg Met Leu Val Lys Phe He Lys Glu 
35 40 45 

His He He Pro Glu Lys Asp Tyr Ala Ala Val Ala Phe Asp Lys Lys 
50 55 60 

Ala Ala Thr Phe Arg His Lys lieu Leu Val Ser Asp Lys Ala Gin Arg 
65 70 75 80 

Pro Lys Thr Pro Ala Leu Leu Val Gin Gin Leu Pro Tyr He Lys Arg 
85 90 95 



Leu He Glu Ala Leu Gly Phe Lys Val Leu Glu Leu Glu Gly Tyr Glu 
100 105 HO 

Ala Asp Asp He He Ala Thr Leu Ala Val Arg Ala Ala Arg Phe Leu 
115 120 125 

Met Arg Phe Ser Leu He Thr Gly Ala Lys Asp Met Leu Gin Leu Val 
130 135 140 

Asn Glu LyB He Lys Val Trp Arg He Val Lys Gly He Ser Asp Leu 
145 150 155 160 

Glu Leu Tyr Asp Ser Lys Lys Val Lys Glu Arg Tyr Gly Val Glu Pro 
165 170 175 

His Gin He Pro Asp Leu Leu Ala Leu Thr Gly Asp Asp He Asp Asn 
180 185 190 

He Pro Gly Val Thr Gly He Gly Glu Lys Thr Ala Val Gin Leu Leu 
195 200 205 

Gly Lys Tyr Arg Asn Leu Glu Tyr He Leu Glu His Ala Arg Glu Leu 
210 215 220 

Pro Gin Arg Val Arg Lys Ala Leu Leu Arg Asp Arg Glu Val Ala He 
225 230 235 240 

Leu Ser Lys Lys Leu Ala Thr Leu Val Thr Asn Ala Pro Val Glu Val 
245 250 255 

Asp Trp Glu Glu Met Lys Tyr Arg Gly Tyr Asp Lys Arg Lys Leu Leu 
260 265 270 

Pro He Leu Lys Glu Leu Glu Phe Ala Ser He Met Lys Glu Leu Gin 
275 280 285 

Leu Tyr Glu Glu Ala Glu Pro Thr Gly Tyr Glu He Val Lys Asp His 
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290 295 300 

Lys Thr Phe Glu Asp Leu He Glu Lys Leu Lys Glu Val Pro Ser Phe 
305 310 315 320 

Ala Leu Ala Leu Glu Thr Ser Ser Leu Asp Pro Phe Asn Cys Glu He 
325 330 335 

Val Gly He Ser Val Ser Phe Lys Pro Lys Thr Ala Tyr Tyr He Pro 
340 345 350 

Leu His His Arg Asn Ala Gin Asn Leu Asp Glu Thr Leu Val Leu Ser 
355 360 365 

Lys Leu Lys Glu He Leu Glu Asp Pro Ser Ser Lys He Val Gly Gin 
370 375 380 



Asn Leu Lys Tyr Asp Tyr Lys Val Leu Met Val Lys Gly He Ser Pro 
385 390 395 400 

Val Tyr Pro His Phe Asp Thr Met He Ala Ala Tyr Leu Leu Glu Pro 
405 410 415 

Asn Glu Lys Lys Phe Asn Leu Glu Asp Leu Ser Leu Lys Phe Leu Gly 
420 425 430 

Tyr Lys Met Thr Ser Tyr Gin Glu Leu Met Ser Phe Ser Ser Pro Leu 
435 44*0 445 

Phe Gly Phe Ser Phe Ala Asp Val Pro Val Asp Lys Ala Ala Asn Tyr 
450 455 460 

Ser Cys Glu Asp Ala Asp He Thr Tyr Arg Leu Tyr Lys He Leu Ser 
465 470 475 480 

Met Lys Leu His Glu Ala Glu Leu Glu Asn Val Phe Tyr Arg He Glu 
485 490 495 

Met Pro Leu Val Asn Val Leu Ala Arg Met Glu Leu Asn Gly Val Tyr 
500 505 510 

Val Asp Thr Glu Phe Leu Lys Lys Leu Ser Glu Glu Tyr Gly Lys Lys 
515 520 525 

Leu Glu Glu Leu Ala Glu Lys He Tyr Gin He Ala Gly Glu Pro Phe 
530 535 540 

Asn He Asn Ser Pro Lys Gin Val Ser Lys He Leu Phe Glu Lys Leu 
545 550 555 560 

Gly He Lys Pro Arg Gly Lys Thr Thr Lys Thr Gly Glu Tyr Ser Thr 
565 570 575 

Arg He Glu Val Leu Glu Glu He Ala Asn Glu His Glu He Val Pro 
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580 585 590 

Leu lie Leu Glu Tyr Arg Lys lie Gin Lys Leu Lys Ser Thr Tyr He 
595 600 605 

Asp Thr Leu Pro Lys Leu Val Asn Pro Lys Thr Gly Arg He His Ala 
610 615 620 

Ser Phe His Gin Thr Gly Thr Ala Thr Gly Arg Leu Ser Ser Ser Asp 
625 630 635 640 

Pro Asn Leu Gin Asn Leu Pro Thr Lys Ser Glu Glu Gly Lys Glu He 
645 650 655 

Arg Lys Ala He Val Pro Gin Asp Pro Asp Trp Trp He Val Ser Ala 
660 665 670 



Asp Tyr Ser Gin He Glu Leu Arg He Leu Ala His Leu Ser Gly Asp 
675 680 685 

Glu Asn Leu Val Lys Ala Phe Glu Glu Gly He Asp Val His Thr Leu 
690 695 700 

Thr Ala Ser Arg He Tyr Asn Val Lys Pro Glu Glu Val Asn Glu Glu 
705 710 715 720 

Met Arg Arg Val Gly Lys Met Val Asn Phe Ser He He Tyr Gly Val 
725 730 735 

Thr Pro Tyr Gly Leu Ser Val Arg Leu Gly He Pro Val Lys Glu Ala 
740 745 750 

Glu Lys Met He He Ser Tyr Phe Thr Leu Tyr Pro Lys Val Arg Ser 
755 760 765 

Tyr He Gin Gin Val Val Ala Glu Ala Lys Glu Lys Gly Tyr Val Arg 
770 775 780 

Thr Leu Phe Gly Arg Lys Arg Asp He Pro Gin Leu Met Ala Arg Asp 
785 790 795 800 

Lys Asn Thr Gin Ser Glu Gly Glu Arg He Ala He Asn Thr Pro He 
805 810 815 

Gin Gly Thr Ala Ala Asp He He Lys Leu Ala Met He Asp He Asp 
820 825 830 

Glu Glu Leu Arg Lys Arg Asn Met Lys Ser Arg Met lie He Gin Val 
835 840 845 

His Asp Glu Leu Val Phe Glu Val Pro Asp Glu Glu Lys Glu Glu Leu 
850 855 860 

Val Asp Leu Val Lys Asn Lys Met Thr Asn Val Val Lys Leu Ser Val 
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865 870 875 880 

Pro Leu Glu Val Asp He Ser He Gly Lys Ser Trp Ser 
885 890 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 893 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Met Ala Arg Leu Phe Leu Phe Ala Gly Thr Ala Leu Ala Tyr Arg Ala 
1 5 10 15 

Tyr Tyr Ala Leu Asp Arg Ser Leu Ser Thr Ser Thr Gly He Pro Thr 
20 25 30 

Asn Ala Val Tyr Gly Val Ala Arg Met Leu Val Lys Phe He Lys Glu 
35 40 45 

His He He Pro Glu Lys Asp Tyr Ala Ala Val Ala Phe Asp Lys Lys 
50 55 60 

Ala Ala Thr Phe Arg His Lys Leu Leu Val Ser Asp Lys Ala Gin Arg 
65 70 75 80 

Pro Lys Thr Pro Ala Leu Leu Val Gin Gin Leu Pro Tyr He Lys Arg 
85 90 95 

Leu He Glu Ala Leu Gly Phe Lys Val Leu Glu Leu Glu Gly Tyr Glu 
100 105 HO 

Ala Asp Asp He He Ala Thr Leu Ala Val Arg Ala Ala Arg Phe Leu 
115 120 125 

Met Arg Phe Ser Leu He Thr Gly Asp Lys Asp Met Leu Gin Leu Val 
130 135 140 

Asn Glu Lys He Lys Val Trp Arg He Val Lys Gly He Ser Asp Leu 
145 150 155 160 

Glu Leu Tyr Asp Ser Lys Lys Val Lys Glu Arg Tyr Gly Val Glu Pro 
165 170 175 

His Gin He Pro Asp Leu Leu Ala Leu Thr Gly Asp Asp He Asp ABn 
180 185 190 



- WO 98/35060 



PCT/US98/02791 



112 



He Pro Gly Val Thr Gly He Gly Glu Lys Thr Ala Val Gin Leu Leu 
195 200 205 

Gly Lys Tyr Arg Asn Leu Glu Tyr He Leu Glu His Ala Arg Glu Leu 
210 215 220 

5 Pro Gin Arg Val Arg Lys Ala Leu Leu Arg Asp Arg Glu Val Ala He 

225 ^ 230 235 240 

Leu Ser Lys Lys Leu Ala Thr Leu Val Thr Asn Ala Pro Val Glu Val 
245 250 255 

Asp Trp Glu Glu Met Lys Tyr Arg Gly Tyr Asp Lys Arg Lys Leu Leu 
10 260 265 270 

Pro He Leu Lys Glu Leu Glu Phe Ala Ser He Met Lys Glu Leu Gin 
275 280 285 

Leu Tyr Glu Glu Ala Glu Pro Thr Gly Tyr Glu He Val Lys Asp His 
290 295 300 

15 Lys Thr Phe Glu Asp Leu He Glu Lys Leu Lys Glu Val Pro Ser Phe 

305 310 315 320 

Ala Leu Ala Leu Glu Thr Ser Ser Leu Asp Pro Phe Asn Cys Glu He 
325 330 335 

Val Gly He. Ser Val Ser Phe Lys Pro Lys Thr Ala Tyr Tyr He Pro 
20 340 345 350 

Leu His His Arg Asn Ala Gin Asn Leu Asp Glu Thr Leu Val Leu Ser 
355 360 365 

Lys Leu Lys Glu He Leu Glu Asp Pro Ser Ser Lys He Val Gly Gin 
370 375 380 

25 Asn Leu Lys Tyr Asp Tyr Lys Val Leu Met Val Lys Gly He Ser Pro 

385 ~ 390 395 400 

Val Tyr Pro His Phe Asp Thr Met He Ala Ala Tyr Leu Leu Glu Pro 
405 410 415 

Asn Glu Lys Lys Phe Asn Leu Glu Asp Leu Ser Leu Lys Phe Leu Gly 
30 420 425 430 

Tyr Lys Met Thr Ser Tyr Gin Glu Leu Met Ser Phe Ser Ser Pro Leu 
435 440 445 

Phe Gly Phe Ser Phe Ala Asp Val Pro Val Asp Lys Ala Ala Asn Tyr 
450 455 460 

35 Ser Cys Glu Asp Ala Asp He Thr Tyr Arg Leu Tyr Lys He Leu Ser 

465 470 475 480 

Met Lys Leu His Glu Ala Glu Leu Glu Asn Val Phe Tyr Arg He Glu 
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485 490 495 

Met Pro Leu Val Asn Val Leu Ala Arg Met Glu Leu Asn Gly Val Tyr 
500 505 510 

Val Asp Thr Glu Phe Leu Lys Lys Leu Ser Glu Glu Tyr Gly Lys Lys 
5 515 520 525 

Leu Glu Glu Leu Ala Glu Lys He Tyr Gin He Ala Gly Glu Pro Phe 
530 535 540 

Asn He Asn Ser Pro Lys Gin Val Ser Lys He Leu Phe Glu Lys Leu 
545 550 555 560 

10 Gly He Lys Pro Arg Gly Lys Thr Thr Lys Thr Gly Glu Tyr Ser Thr 

565 570 575 



Arg He Glu Val Leu Glu Glu He Ala Asn Glu His Glu He Val Pro 
580 585 590 

Leu He Leu Glu Tyr Arg Lys He Gin Lys Leu Lys Ser Thr Tyr He 
15 595 600 605 

Asp Thr Leu Pro Lys Leu Val Asn Pro Lys Thr Gly Arg He His Ala 
610 615 620 

Ser Phe His- Gin Thr Gly Thr Ala Thr Gly Arg Leu Ser Ser Ser Asp 
625 630 635 640 

20 Pro Asn Leu Gin Asn Leu Pro Thr Lys Ser Glu Glu Gly Lys Glu lie 

645 650 655 

Arg Lys Ala He Val Pro Gin Asp Pro Asp Trp Trp He Val Ser Ala 
660 665 670 

Asp Tyr Ser Gin He Glu Leu Arg He Leu Ala His Leu Ser Gly Asp 
25 675 680 685 

Glu Asn Leu Val Lys Ala Phe Glu Glu Gly He Asp Val His Thr Leu 
690 695 700 

Thr Ala Ser Arg He Tyr Asn Val Lys Pro Glu Glu Val Asn Glu Glu 
705 " 710 715 720 

30 Met Arg Arg Val Gly Lys Met Val Asn Phe Ser He He Tyr Gly Val 

725 730 735 

Thr Pro Tyr Gly Leu Ser Val Arg Leu Gly He Pro Val Lys Glu Ala 
740 745 750 

Glu Lys Met II He S r Tyr Phe Thr Leu Tyr Pro Lys Val Arg Ser 
35 755 760 765 

Tyr He Gin Gin Val Val Ala Glu Ala Lys Glu Lys Gly Tyr Val Arg 
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770 775 780 

Thr Leu Phe Gly Arg Lys Arg Asp lie Pro Gin Leu Met Ala Arg Asp 
785 790 795 800 

Lys Asn Thr Gin Ser Glu Gly Glu Arg lie Ala He Asn Thr Pro He 
805 810 815 

Gin Gly Thr Ala Ala Asp He He Lys Leu Ala Met He Asp He Asp 
820 825 830 

Glu Glu Leu Arg Lys Arg Asn Met Lys Ser Arg Met He He Gin Val 
835 840 845 

His Asp Glu Leu Val Phe Glu Val Pro Asp Glu Glu Lys Glu Glu Leu 
850 855 860 



Val Asp Leu Val Lys Asn Lys Met Thr Asn Val Val Lys Leu Ser Val 
865 870 875 880 

Pro Leu Glu Val Asp He Ser He Gly Lys Ser Trp Ser 
885 890 

(2) INFORMATION FOR SEQ ID NO: 8: 

<i) SEQUENCE CHARACTERISTICS: 

. (A) LENGTH: 893 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNBSS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Ala Arg Leu Phe Leu Phe Asp Gly Thr Ala Leu Ala Tyr Arg Ala 
15 10 15 

Tyr Tyr Ala Leu Asp Arg Ser Leu Ser Thr Ser Thr Gly He Pro Thr 
20 25 30 

Asn Ala Val Tyr Gly Val Ala Arg Met Leu Val Lys Phe He Lys Glu 
35 40 45 

His He He Pro Glu Lys Asp Tyr Ala Ala Val Ala Phe Asp Lys Lys 
50 55 60 

Ala Ala Thr Phe Arg His Lys Leu Leu Val Ser Asp Lys Ala Gin Arg 
65 70 75 80 

Pro Lys Thr Pro Ala Leu Leu Val Gin Gin Leu Pro Tyr He Lys Arg 
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85 



90 



95 



Leu He Glu Ala Leu Gly Phe Lys Val Leu Glu Leu Glu Gly Tyr Glu 

100 105 HO 

Ala Asp Asp He He Ala Thr Leu Ala Val Arg Ala Ala Arg Phe Leu 

115 120 125 

Met Arg Phe Ser Leu He Thr Gly Asp Lys Asp Met Leu Gin Leu Val 

130 135 140 

Asn Glu Lys He Lys Val Tip Arg He Val Lys Gly He Ser Asp Leu 

145 150 155 160 

Glu Leu Tyr Asp Ser Lys Lys Val Lys Glu Arg Tyr Gly Val Glu Pro 



His Gin He Pro Asp Leu Leu Ala Leu Thr Gly Asp Asp He Asp Asn 
180 185 190 

He Pro Asp Val Thr Gly He Gly Glu Lys Thr Ala Val Gin Leu Leu 
195 200 205 

Gly Lys Tyr Arg Asn Leu Glu Tyr He Leu Glu His Ala Arg Glu Leu 
210 215 220 

Pro Gin Arg Val Arg Lys Ala Leu Leu Arg Asp Arg Glu Val Ala He 
225 230 235 240 

Leu Ser Lys Lys Leu Ala Thr Leu Val Thr Asn Ala Pro Val Glu Val 
245 250 255 

Asp Trp Glu Glu Met Lys Tyr Arg Gly Tyr Asp Lys Arg Lys Leu Leu 
260 265 270 

Pro He Leu Lys Glu Leu Glu Phe Ala Ser He Met Lys Glu Leu Gin 
275 280 285 . 

Leu Tyr Glu Glu Ala Glu Pro Thr Gly Tyr Glu He Val Lys Asp HiB 
290 295 300 

Lys Thr Phe Glu Asp Leu He Glu Lys Leu Lys Glu Val Pro Ser Phe 
305 310 315 320 

Ala Leu Ala Leu Glu Thr Ser Ser Leu Asp Pro Phe Asn Cys Glu He 
325 330 335 

Val Gly He Ser Val Ser Phe Lys Pro Lys Thr Ala Tyr Tyr He Pro 
340 345 350 

Leu His His Arg Asn Ala Gin Asn Leu Asp Glu Thr Leu Val L u Ser 
355 360 365 

Lys Leu Lys Glu He Leu Glu Asp Pro Ser Ser Lys He Val Gly Gin 



165 



170 



175 
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370 



375 



380 



Asn Leu Lys Tyr Asp Tyr Lys Val Leu Met Val Lys Gly He Ser Pro 
385 390 395 400 

Val Tyr Pro His Phe Asp Thr Met He Ala Ala Tyr Leu Leu Glu Pro 
405 410 415 

Asn Glu Lys Lys Phe Asn Leu Glu Asp Leu Ser Leu Lys Phe Leu Gly 
420 425 430 

Tyr Lye Met Thr Ser Tyr Gin Glu Leu Met Ser Phe Ser Ser Pro Leu 
435 440 445 

Phe Gly Phe Ser Phe Ala Asp Val Pro Val Asp Lys Ala Ala Asn Tyr 
450 455 460 



Ser Cys Glu Asp Ala Asp He Thr Tyr Arg Leu Tyr Lys He Leu Ser 
465 470 475 480 

Met Lys Leu His Glu Ala Glu Leu Glu Asn Val Phe Tyr Arg He Glu 



485 



490 



495 



Met Pro Leu Val Asn Val Leu Ala Arg Met Glu Leu Asn Gly Val Tyr 
500 505 510 

Val Asp Thr Glu Phe Leu Lys Lys Leu Ser Glu Glu Tyr Gly Lys LyB 
515 520 525 

Leu Glu Glu Leu Ala Glu Lys He Tyr Gin He Ala Gly Glu Pro Phe 
530 535 540 

Asn He Asn Ser Pro Lys Gin Val Ser Lys He Leu Phe Glu Lys Leu 
545 550 555 560 

Gly He Lys Pro Arg Gly Lys Thr Thr Lys Thr Gly Glu Tyr Ser Thr 
565 .570 575 

Arg lie v Glu Val Leu Glu Glu He Ala Asn Glu His Glu He Val Pro 
580 585 590 

Leu He Leu Glu Tyr Arg Lys He Gin Lys Leu Lys Ser Thr Tyr He 
595 600 605 

Asp Thr Leu Pro Lys Leu Val Asn Pro Lys Thr Gly Arg He His Ala 
610 615 620 

Ser Phe His Gin Thr Gly Thr Ala Thr Gly Arg Leu Ser Ser Ser Asp 
625 630 635 640 

Pro Asn Leu Gin Asn Leu Pro Thr Lys Ser Glu Glu Gly Lys Glu He 
645 650 655 

Arg Lys Ala He Val Pro Gin Asp Pro Asp Trp Trp He Val Ser Ala 
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660 



665 



670 



Asp Tyr Ser Gin lie Glu Leu Arg lie Leu Ala His Leu Ser Gly Asp 
675 680 685 

Glu Asn Leu Val Lys Ala Phe Glu Glu Gly lie Asp Val His Thr Leu 
690 695 700 

Thr Ala Ser Arg lie Tyr Asn Val Lys Pro Glu Glu Val Asn Glu Glu 
705 710 715 720 

Met Arg Arg Val Gly Lys Met Val Asn Phe Ser He He Tyr Gly Val 
725 730 735 

Thr Pro Tyr Gly Leu Ser Val Arg Leu Gly He Pro Val Lys Glu Ala 
740 745 750 



Glu Lys Met He He Ser Tyr Phe Thr Leu Tyr Pro Lys Val Arg Ser 
755 760 765 

Tyr He Gin Gin Val Val Ala Glu Ala Lys Glu Lys Gly Tyr Val Arg 
770 775 780 

Thr Leu Phe Gly Arg Lys Arg Asp He Pro Gin Leu Met Ala Arg Asp 
785 790 795 800 

Lys Asn Thr Gin Ser Glu Gly Glu Arg He Ala He Asn Thr Pro He 
805 810 815 

Gin Gly Thr Ala Ala Asp He He Lys Leu Ala Met He Asp He Asp 
820 825 830 

Glu Glu Leu Arg Lys Arg Asn Met Lys Ser Arg Met He He Gin Val 
835 840 845 

His Asp Glu Leu Val Phe Glu Val Pro Asp Glu Glu Lys Glu Glu Leu 
850 855 860 

Val Asp Leu Val Lys Asn Lys Met Thr Asn Val Val Lys Leu Ser Val 
865 870 875 880 

Pro Leu Glu Val Asp He Ser He Gly Lys Ser Trp Ser 
885 890 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 893 amino acids 

(B) TYPE: amino acid 

(C) STRANDBDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Met Ala Arg Leu Phe Leu Phe Asp Gly Thr Ala Leu Ala Tyr Arg Ala 
1 5 10 15 

Tyr Tyr Ala Leu Asp Arg Ser Leu Ser Thr Ser Thr Gly He Pro Thr 
20 25 30 

Asn Ala Val Tyr Asp Val Ala Arg Met Leu Val Lys Phe He Lys Glu 
35 40 45 

His He He Pro Glu Lys Asp Tyr Ala Ala Val Ala Phe Asp Lys Lys 
50 55 60 

Ala Ala Thr Phe Arg His Lys Leu Leu Val Ser Asp Lys Ala Gin Arg 
65 70 75 80 

Pro Lys Thr Pro Ala Leu Leu Val Gin Gin Leu Pro Tyr He Lys Arg 
85 90 95 

Leu He Glu Ala Leu Gly Phe Lys Val Leu Glu Leu Glu Gly Tyr Glu 
100 105 HO 

Ala Asp Asp He He Ala Thr Leu Ala Val Arg Ala Ala Arg Phe Leu 
115 120 125 

Met Arg Phe Ser Leu He Thr Gly Asp Lys Asp Met Leu Gin Leu Val 
130 135 140 

Asn Glu Lys He Lys Val Trp Arg He Val Lys Gly He Ser Asp Leu 
145 150 155 160 

Glu Leu Tyr Asp Ser Lys Lys Val Lys Glu Arg Tyr Gly Val Glu Pro 
165 170 175 

His Gin He Pro Asp Leu Leu Ala Leu Thr Gly Asp Asp He Asp Asn 
180 185 190 

He Pro Gly Val Thr Gly He Gly Glu Lys Thr Ala Val Gin Leu Leu 
195 200 205 

Gly Lys Tyr Arg Asn Leu Glu Tyr He Leu Glu His Ala Arg Glu Leu 
210 215 220 

Pro Gin Arg Val Arg Lys Ala Leu Leu Arg Asp Arg Glu Val Ala He 
225 230 235 240 

Leu Ser Lys Lys Leu Ala Thr Leu Val Thr Asn Ala Pro Val Glu Val 
245 250 255 

Asp Trp Glu Glu Met Lys Tyr Arg Gly Tyr Asp Lys Arg Lys Leu Leu 
260 265 270 

Pro He Leu Lys Glu Leu Glu Ph Ala Ser H Met Lys Glu Leu Gin 
275 280 285 
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Leu Tyr Glu Glu Ala Glu Pro Thr Gly Tyr Glu lie Val Lys Asp His 
290 295 300 

Lys Thr Phe Glu Asp Leu He Glu Lys Leu Lys Glu Val Pro Ser Phe 
305 310 315 320 

Ala Leu Ala Leu Glu Thr Ser Ser Leu Asp Pro Phe Asn Cys Glu He 
325 330 335 

Val Gly He Ser Val Ser Phe Lys Pro Lys Thr Ala Tyr Tyr He Pro 
340 345 350 

Leu His His Arg Asn Ala Gin Asn Leu Asp Glu Thr Leu Val Leu Ser 
355 360 365 



Lys Leu Lys Glu He Leu Glu Asp Pro Ser Ser Lys He Val Gly Gin 
370 375 380 

Asn Leu Lys Tyr Asp Tyr Lys Val Leu Met Val Lys Gly He Ser Pro 
385 390 395 400 

Val Tyr Pro His Phe Asp Thr Met He Ala Ala Tyr Leu Leu Glu Pro 
405 410 415 

Asn Glu Lys Lys Phe Asn Leu Glu Asp Leu Ser Leu Lys Phe Leu Gly 
420 425 430 

Tyr Lys Met Thr Ser Tyr Gin Glu Leu Met Ser Phe Ser Ser Pro Leu 
435 440 445 

Phe Gly Phe Ser Phe Ala Asp Val Pro Val Asp Lys Ala Ala Asn Tyr 
450 455 460 

Ser Cys Glu Asp Ala Asp He Thr Tyr Arg Leu Tyr Lys He Leu Ser 
465 470 475 480 

Met Lys Leu His Glu Ala Glu Leu Glu Asn Val Phe Tyr Arg He Glu 
485 '490 495 

Met Pro Leu Val Asn Val Leu Ala Arg Met Glu Leu Asn Gly Val Tyr 
500 505 510 

Val Asp Thr Glu Phe Leu Lys Lys Leu Ser Glu Glu Tyr Gly Lys Lys 
515 520 525 

Leu Glu Glu Leu Ala Glu Lys He Tyr Gin lie Ala Gly Glu Pro Phe 
530 535 540 

Asn lie Asn Ser Pro Lys Gin Val Ser Lys He Leu Phe Glu Lys Leu 
545 550 555 560 



Gly He Lys Pro Arg Gly Lys Thr Thr Lys Thr Gly Glu Tyr Ser Thr 
565 570 575 
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Arg He Glu Val Leu Glu Glu He Ala Asn Glu His Glu He Val Pro 
580 585 590 

Leu He Leu Glu Tyr Arg Lys He Gin Lys Leu Lys Ser Thr Tyr He 
595 600 605 

Asp Thr Leu Pro Lys Leu Val Asn Pro Lys Thr Gly Arg He His Ala 
610 615 620 

Ser Phe His Gin Thr Gly Thr Ala Thr Gly Arg Leu Ser Ser Ser Asp 
625 630 635 640 

Pro Asn Leu Gin Asn Leu Pro Thr Lys Ser Glu Glu Gly Lys Glu He 
645 650 655 



Arg Lys Ala He Val Pro Gin Asp Pro Asp Trp Trp He Val Ser Ala 
660 665 670 

Asp Tyr Ser Gin He Glu Leu Arg He Leu Ala His Leu Ser Gly Asp 
675 680 685 

Glu Asn Leu Val Lys Ala Phe Glu Glu Gly He Asp Val His Thr Leu 
690 695 700 

Thr Ala Ser Arg He Tyr Asn Val Lys Pro Glu Glu Val Asn Glu Glu 
705 710 715 720 

Met Arg Arg Val Gly Lys Met Val Asn Phe Ser He He Tyr Gly Val 
725 730 735 

Thr Pro Tyr Gly Leu Ser Val Arg Leu Gly He Pro Val Lys Glu Ala 
740 745 750 



Glu Lys Met He He Ser Tyr Phe Thr Leu Tyr Pro Lys Val Arg Ser 
755 760 765 

Tyr He Gin Gin Val Val Ala Glu Ala Lys Glu Lys Gly Tyr Val Arg 
770 775 780 

Thr Leu Phe Gly Arg Lys Arg Asp He Pro Gin Leu Met Ala Arg Asp 
785 790 795 800 

Lys Asn Thr Gin Ser Glu Gly Glu Arg He Ala He Asn Thr Pro He 
805 810 815 

Gin Gly Thr Ala Ala Asp He He Lys Leu Ala Met He Asp He Asp 
820 825 830 

Glu Glu Leu Arg Lys Arg Asn Met Lys Ser Arg Met He He Gin Val 
835 840 845 

His Asp Glu Leu Val Phe Glu Val Pro Asp Glu Glu Lys Glu Glu Leu 
850 855 860 
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Val Asp Leu Val Lys Asn Lys Met Thr Asn Val Val Lys Leu Ser Val 
865 870 875 880 

Pro Leu Glu Val Asp lie Ser He Gly Lys Ser Trp Ser 
885 890 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 610 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Lys Glu Leu Gin Leu Tyr Glu Glu Ala Glu Pro Thr Gly Tyr Glu 
1 5 10 15 

He Val Lys Asp His Lys Thr Phe Glu Asp Leu He Glu Lys Leu Lys 
20 25 30 

Glu Val Pro Ser Phe Ala Leu Asp Leu Glu Thr Ser Ser Leu Asp Pro 
35 40 45 

Phe Asn Cys Glu He Val Gly He Ser Val Ser Phe Lys Pro Lys Thr 
50 55 60 

Ala Tyr Tyr He Pro Leu His His Arg Asn Ala Gin Asn Leu Asp Glu 
65 * 70 75 80 

Thr Leu Val Leu Ser Lys Leu Lys Glu He Leu Glu Asp Pro Ser Ser 
85 .90 95 

Lys He Val Gly Gin Asn Leu Lys Tyr Asp Tyr Lys Val Leu Met Val 
100 105 HO 

Lys Gly He Ser Pro Val Tyr Pro His Phe Asp Thr Met He Ala Ala 
115 120 125 

Tyr Leu Leu Glu Pro Asn Glu Lys Lys Phe Asn Leu Glu Asp Leu Ser 
130 135 140 

Leu Lys Phe Leu Gly Tyr Lys Met Thr Ser Tyr Gin Glu Leu Met Ser 
145 150 * 155 1€0 

Phe Ser Ser Pro Leu Phe Gly Phe Ser Phe Ala Asp Val Pro Val Asp 
165 170 175 

Lys Ala Ala Asn Tyr Ser Cys Glu Asp Ala Asp He Thr Tyr Arg Leu 
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180 185 190 

Tyr Lye He Leu Ser Met Lys Leu His Glu Ala Glu Leu Glu Asn Val 
195 200 205 

Phe Tyr Arg He Glu Met Pro Leu Val Asn Val Leu Ala Arg Met Glu 
210 215 220 

Leu Asn Gly Val Tyr Val Asp Thr Glu Phe Leu Lys Lys Leu Ser Glu 
225 230 235 240 

Glu Tyr Gly Lys Lys Leu Glu Glu Leu Ala Glu Lys He Tyr Gin He 
245 250 255 

Ala Gly Glu Pro Phe Asn He Asn Ser Pro Lys Gin Val Ser Lys He 
260 265 270 

Leu Phe Glu Lys Leu Gly He Lys Pro Arg Gly Lys Thr Thr Lys Thr 
275 280 285 

Gly Glu Tyr Ser Thr Arg He Glu Val Leu Glu Glu He Ala Asn Glu 
290 295 300 

His Glu He Val Pro Leu He Leu Glu Tyr Arg Lys He Gin Lys Leu 
305 310 315 320 

Lys Ser Thr Tyr He Asp Thr Leu Pro Lys Leu Val Asn Pro Lys Thr 
325 330 335 

Gly Arg He His Ala Ser Phe His Gin Thr Gly Thr Ala Thr Gly Arg 
340 345 350 

Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn Leu Pro Thr Lys Ser Glu 
355 360 365 

Glu Gly Lys Glu He Arg Lys Ala He Val Pro Gin Asp Pro Asp Trp 
370 375 380 

Trp He Val Ser Ala Asp Tyr Ser Gin lie Glu Leu Arg He Leu Ala 
385 390 395 400 

HiB Leu Ser Gly Asp Glu Asn Leu Val Lys Ala Phe Glu Glu Gly He 
405 410 415 

Asp Val His Thr Leu Thr Ala Ser Arg He Tyr Asn Val Lys Pro Glu 
420 425 430 

Glu Val Asn Glu Glu Met Arg Arg Val Gly Lys Met Val Asn Phe Ser 
435 440 445 

He He Tyr Gly Val Thr Pro Tyr Gly Leu Ser Val Arg Leu Asn He 
450 455 460 

Pro Val Lys Glu Ala Glu Lys Met He He Ser Tyr Phe Thr Leu Tyr 
465 470 475 480 



- WO 98/35060 



PCT/US98/02791 



123 



Pro Lys Val Arg Ser Tyr He Gin Gin Val Val Ala Glu Ala Lys Glu 
485 490 495 

Lys Gly Tyr Val Arg Thr Leu Phe Gly Arg Lys Arg Asp He Pro Gin 
500 505 510 

Leu Met Ala Arg Asp Lys Asn Thr Gin Ser Glu Gly Glu Arg He Ala 
515 520 525 

He Asn Thr Pro He Gin Gly Thr Ala Ala Asp He He Lys Leu Ala 
530 535 540 

Met He Asp He Asp Glu Glu Leu Arg Lys Arg Asn Met Lys Ser Arg 
545 550 555 560 



Met He He Gin Val His Asp Glu Leu Val Phe Glu Val Pro Asp Glu 
565 570 575 

Glu Lys Glu Glu Leu Val Asp Leu Val Lys Asn Lys Met Thr Asn Val 
580 585 590 

Val Lys Leu Ser Val Pro Leu Glu Val Asp He Ser He Gly Lys Ser 
595 600 605 

Trp Ser 
610 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE : amino acid 

(C) STRAND EDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



( ix) FEATURE : 

(A) NAME /KEY: Modified- site 

(B) LOCATION: 1..14 

(D) OTHER INFORMATION: /note= wl Xaa' is any amino acid" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Arg Xaa Xaa Xaa Lys Xaa Xaa Xaa Phe Xaa Xaa Xaa Tyr Xaa 
15 10 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Arg Arg Ser Ala Lys Ala He Asn Phe Gly Leu He Tyr Gly 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr Gly 
15 10 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Arg Asp Asn Ala Lys Thr Phe He Tyr Gly Phe Leu Tyr Gly 
15 10 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 
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(D) TOPOLOGY: linear 
<ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 

Arg Arg Val Gly Lys Met Val Asn Phe Ser He He Tyr Gly 
5 1 5 10 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

10 (C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:16: 

Arg Gin Ala Ala Lys Ala He Thr Phe Gly He Leu Tyr Gly 
15 1 5 10 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

20 (C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Arg Arg Ala Gly Lys Met Val Asn Phe Ser He He Tyr Gly 
25 1 5 10 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

30 (C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Pro Ser Phe Ala Leu Asp Leu Glu Thr Ser Ser 
15 10 

5 (2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : not relevant 
10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Pro Val Phe Ala Phe Asp Thr Glu Thr Asp Ser 
15 10 

15 (2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Gly Pro Val Ala Phe Asp Ser Glu Thr Ser Ala 
15 10 

25 (2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Met He Val Ser Asp He Glu Ala Asn Ala 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 22: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDED NESS : both 

(D) TOPOLOGY: both 

10 (ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

GACGTTTCAA GCGCTAGGGC AAAAGA 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

GTATATTATA GAGTAGTTAA CCATCTTTCC A 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 6 amino acids 
25 (B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
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Phe Leu Phe Asp Gly Thr 
1 5 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

Leu Leu Val Asp Gly His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 10 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Ser lieu He Thr Gly Asp Lys Asp Met Leu 
1 5 -10 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 
Arg He Leu Thr Ala Asp Lys Asp Leu Tyr 
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15 10 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : both 

(D) TOPOLOGY : both 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
GTAGGCCAGG GCTGTGCCGG CAAAGAGAAA TAGTC 
(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
GAAGCATATC CTTGGCGCCG GTTATTATGA AAATC 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
CACCAGACGG GTACCGCCAC TGGCAGGTTG 
(2) INFORMATION FOR SEQ ID NO: 31: 



30 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 
5 (D) TOPOLOGY: both 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:31: 
TATAGAGTAG TTAACCATCT TTCCAACCCG TTTCATTTCT TCGAACAC 
(2) INFORMATION FOR SEQ ID NO: 32: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

15 (ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

TATAGAGTAG TTAACCATCT TTCCAACCCG TTGCATTTCT TCGAACAC 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

TATAGAGTAG TTAACCATCT TTCCAACCCG GTTCATTTCT TCGAACAC 48 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 48 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 
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(D) TOPOLOGY: both 
(ii) MOLECULE TYPE; CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
TATAGAGTAG TTAACCATCT TTCCAACCCG ATGCATTTCT TCGAACAC 
(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
AAGATGGTTA ACGCGTCTAT AATATACGG 
(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
CAA6AGGCAC AGAGAGTTTC ACC 
(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: CDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
GTATATTATA GAGGAGTTAA CCATCTTTCC 
(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NE S S : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 38: 
AAGATGGTTA ACTTCTCTAT AATATACGG 
(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
TATAGAGTAG TTAACCATCT TTCCAACCCG GTACATGTCT TCGTTCAC 
(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
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TATAGAGTAG TTAACCATCT TTCCAACCCG CAACATGTCT TCGTTCAC 48 
(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 
<D) TOPOLOGY: both 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
CTTGGCCGCC CGATGCATCA GGGGGTC 
(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 
CTTGGCCGCC CGCTTCATGA GGGGGTCCAC 
(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 43: 
CTTGGCCGCC CTGTACATCA GGGGGTC 
(2) INFORMATION FOR SEQ ID NO: 44: 



27 



WO 98/35060 



PCT/US98/02791 



134 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
GTATATTATA GAGGTGTTAA CCATCTTTCC 
(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 
GGGAGACCGG AATTCTCCTT CATTAATTCC TATA 
(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 
TGGAGACCCT GGAACTATAG GAATTAATGA AGGAGAATTC CGGTCTCCC 
(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 
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(D) TOPOLOGY: both 
(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
GTATTTTGGT ATGCTTGTGC 
(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: 
CTATTTTGGA ATATATGTGC CT 
(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 
ACGAACATTC TACAAGTTAC 
(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
TTTCAGAGAA ACTGACCTGT 
(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 
GATAAATGCC AAACATGTTG T 
(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 
TGCTCTCAGG ATTTCCTCCA 
(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 
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AGCTTGAGAC CTCTGTGTCC 

(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 
10 ATTCAGAAGA AACAGTGATG GT 

(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
TTGGAGTCGC AAGCTGAACT AGC 
20 (2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 
25 (D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 
GCCTGAGTGA CAGAGTGAGA ACC 
(2) INFORMATION FOR SEQ ID NO: 57: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 
5 (D) TOPOLOGY : both 

(ii) MOLECULE TYPE: cDNA 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 
CCCACTAGGT TGTAAGCTCC ATGA 
(2) INFORMATION FOR SEQ ID NO: 58: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

15 (ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

TACTATGTGC CAGGCTCTGT CCTA 

(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

ACTCATGAAG GTGACAGTTC 

(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 
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(D) TOPOLOGY: both 
(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 
GTGTTGTTGA CCTATTGCAT 



5 (2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 
10 (D) TOPOLOGY: both 

(ii) MOLECULE TYPE : CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 
ATCTCTGTTC CCTCCCTGTT 
(2) INFORMATION FOR SEQ ID NO: 62: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

20 (ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

CTTATTGGCC TTGAAGGTAG 

(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: CDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

AGCCCGTGTT GGAACCATGA CTG 

(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNE S S : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: CDNA 



10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

TACATAGCGA GACTCCATCT CCC 

(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 
20 TTTATGCGAG CGTATGGATA 

(2) INFORMATION FOR SEQ ID NO: 66:. 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: CDNA 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 66: 
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CACCACCATT GATCTGGAAG 

(2) INFORMATION FOR SEQ ID NO: €7: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 16 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 
10 CCAACCACAC TGGGAA 

(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 
AACAGTTGCC CACGGT 
20 (2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 
25 (D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 
CATGAAATGC TGACTGGGTA 
(2) INFORMATION FOR SEQ ID NO: 70: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS : both 
5 (D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 
TCAATTTATG TGCAGCCAAT 
(2) INFORMATION FOR SEQ ID NO: 71: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

15 (ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

CATAGCGAGA CTCCATCTCC 

(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

GGGAGAGGGC AAAGATCGAT 

(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 22 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 
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(D) TOPOLOGY : both 
(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 
AACACTAGTG ACATTATTTT CA 
(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 
AGCTAGGCCT GAAGGCTTCT 
(2) INFORMATION FOR SEQ ID NO:75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 
CCCTAGTGGA TGATAAGAAT AATC 
(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 
GGACAGATGA TAAATACATA GGATGGATGG 
(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:77: 
TTCTCTTACA ACACTGCCCC 
(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 
ATTTGGATGG CTTGACAGAG 
(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:79: 
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ACATTCTAAG ACTTTCCCAA T 

(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRAND ED NESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 
10 AGAGCATGCA CCCTGAATTG 

(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 
AAGAACCATG CGATACGACT 
20 (2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 
25 (D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 
CATTCCTAGA TGGGTAAAGC 
(2) INFORMATION FOR SEQ ID NO: 83: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 
GCTTAGTCAT ACGAGCGG 
(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 
TCCACAGCCA TGTAAACC 
(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 
CCCCGGAGCA AGTTCA 

(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 
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(D) TOPOLOGY : both 
(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 
CAGCCCAAAG CCAGATTA 
<2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 
ATATGTGAGT CAATTCCCCA AG 
(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 
TGTATTAGTC AATGTTCTCC AG 
(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 
CAGCTGCCCT AGTCAGCAC 
(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNE S S : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 
GCTTCCGAGT GCAGGTCACA 
(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:91: 
ATTCTGGGCG CACAAGAGTG A 
(2) INFORMATION FOR SEQ ID NO:92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 92: 
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ACATCTCCCC TACCGCTATA 
(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:93: 
GAAGTTCACC ATCCGGCCGA CCCGTCGCAT TTC 
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WHAT IS CLAIMED IS: 

1 . A method of identifying, analyzing or typing a polymorphic DNA 
fragment in a sample of DNA, said method comprising contacting said sample of 
DNA with one or more DNA polymerases substantially reduced in the ability to 

5 add one or more non-templated nucleotides to the 3' terminus of a DNA molecule, 

amplifying said polymorphic DNA fragment within said sample and analyzing said 
amplified polymorphic DNA fragment. 

2. A method of producing amplified copies of a polymorphic DNA 
fragment which comprise substantially no non-templated 3' terminal nucleotides, 

10 said method comprising contacting a DNA sample with one or more DNA 

polymerases substantially reduced in the ability to add one or more non-templated 
nucleotides to the 3* terminus of a DNA molecule and amplifying said polymorphic 
DNA fragment within said DNA sample. 

3 A method of cloning a DNA molecule comprising contacting said 
15 DNA molecule with one or more DNA polymerases substantially reduced in the 

ability to add one or more non-templated nucleotides to the 3' terminus of a DNA 
molecule, amplifying said DNA molecule and inserting said DNA molecule into 
a vector. 

4. The method of claim 3, wherein said vector is blunt-ended. 

20 5 . The method of claim 1 , wherein said polymorphic DNA fragment 

is selected from the group of polymorphic DNA fragments comprising a 
minisatellite DNA fragment, a microsatellite DNA fragment and a STR DNA 
fragment. 



25 



6. The method of claim 1 , wherein said polymerases are thermostable 
DNA polymerases. 
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7. The method of claim 6, wherein said thermostable DNA 
polymerases are Thermotoga DNA polymerases and mutants or derivatives 
thereof. 

8. The method of claim 7, wherein said DNA polymerase is a Tne or 
5 Tma DNA polymerase. 

9. The method of claim 1, wherein said DNA polymerases are 
substantially reduced in 3-5 1 exonuclease activity. 

10. The method of claim 1, wherein said DNA polymerases are 
substantially reduced in 5-3' exonuclease activity. 

10 11. The method of claim 9, wherein said DNA polymerases are 

substantially reduced in 5-3' exonuclease activity. 

12. The method of claim 1, wherein said DNA polymerases contain one 
or more modifications or mutations which reduce the ability of the polymerase to 
add one or more non-templated 3' nucleotides to a synthesized nucleic acid 

15 molecule. 

13. The method of claim 12, wherein said DNA polymerases are 
substantially reduced in at least one activity selected from the group consisting of: 

(a) 3-5' exonuclease activity; and 

(b) 5-3' exonuclease activity. 

20 14. The method of claim 13, wherein said polymerases have 

substantially reduced 3-5 1 exonuclease and 5'-3' exonuclease activity. 

1 5 . The method of claim 13, wherein said polymerase is substantially 
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reduced in 3-5' exonuclease activity. 

16. The method of claim 12, wherein said polymerases comprise one 
or more mutations or modifications in the O-helix of said polymerase. 

17. The method of claim 16, wherein said O-helix is defined as 
IODDOCXXXFXXXYX (SEQ ID NO:l 1), wherein X is any amino acid. 

18. The method of claim 17, wherein said mutation or modification is 
at position R (Arg) and/or F (Phe) and/or K (Lys) of said O-helix or combinations 
thereof 

1 9. The method of claim 1 6, wherein said mutation or modification is 
an amino acid substitution at position R and/or F and/or K of said O-helix or 
combinations thereof. 

20. The method of claim 1, wherein said polymerase is selected from 
the group consisting of: 

Tne WA219, D323A; 
7heN l A283 1 D323A; 
7heN'A284, D323A; 
7teN , A193,D323A; 
7weD137A, D323A; 
Tne D8A, D323A; 
Tne G195D, D323A; 
Tne G37D, D323A; 
7weN'A283; 

Tne D137A, D323A, R722K; 
Tne D137 A, D323A, R722Y; 
Tne D137A, D323A, R722L; 
Tne D137A, D323A, R722H; 
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Tne D137A, D323A, R722Q; 
Tne D137A, D323A, F730Y; 
Tne D137A, D323A, K726R; 
Tne D137A, D323A, K726H; 
Tne DMA, D323A, R722K, F730Y; 
Tne D137A, D323A, R722K, K726R 
Tne D137A, D323A, R722K, K726H; 
Tne D137A, D323A, R722H, F730Y; 
Tne D137A, D323A, R722H, K726R; 

Tne D137A, D323A, R722H, K726H; 

Tne D137A, D323A, R722Q, F730Y; 

Tne D137A, D323A, R722Q, K726R 

Tne D137A, D323A, R722Q, K726H; 

Tne D137A, D323A, R722N, F730Y; 

Tne D137A, D323A, R722N, K726R; 

Tne D137A, D323A, R722N, K726H; 

Tne D137A, D323A, F730S; 

Tne N'A283, D323A, R722K/H/Q/N/Y/L; 

Tne N*A219, D323A, R722K; 

Tne N*A219, D323A, F730Y; 

Tne 1SPA219, D323A, K726R; 

Tne WA219, D323A, K726H; 

Tne D137A, D323A, F730S, R722K7Y/Q/N/H/L, K726R/H; 

7«e D137A, D323A, F730T, R722K/Y/Q/N/H/L, K726R/H; 

7>»eD137A,D323A,F730T; 

Tne F730S; 

7he F730A; 

7he K726R; 

7WeK726H;and 

7«e D137A, D323A, R722N. 
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21. A method of determining the relationship between a first individual 
and a second individual, said method comprising comparing a population of 
amplified DNA molecules in a sample of DNA from said first individual to that of 
said second individual prepared according to the method of claim 1 . 

22. The method of claim 21, wherein said sample of DNA from said 
first individual is a known sample and said sample of DNA from said second 
individual is an unknown sample. 

23. A kit for the identification, analysis or typing of a polymorphic 
DNA fragment, said kit comprising one or more DNA polymerases substantially 
reduced in the ability to add one or more non-templated nucleotides to the 3' 
terminus of a DNA molecule. 

24. The kit of claim 23, said kit further comprising one or more 
components selected from the group consisting of one or more DNA primers, one 
or more deoxynucleoside triphosphates, and a buffer suitable for use in the 
identification, analysis or typing of a polymorphic DNA fragment. 

25. The kit of claim 23, wherein said polymerases are thermostable 
DNA polymerases. 

26. The kit of claim 25, wherein thermostable DNA polymerases are 
Thermotoga DNA polymerases. 

27. The kit of claim 23, wherein said DNA polymerase is substantially 
reduced in 3'-5' exonuclease activity. 

28. The kit of claim 23, wherein said DNA polymerase is substantially 
reduced in 5-3' exonuclease activity. 
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29. The kit of claim 23, wherein said DN A polymerases comprise one 
or more modifications or mutations which reduce the ability of the polymerase to 
add one or more non-templated 3' nucleotides to a synthesized nucleic acid 
molecule. 

30. The kit of claim 29, wherein said polymerases comprise one or 
more mutations in the O-helix of said polymerase. 

31. The kit of claim 30, wherein said O-helix is defined as 
RXXXKXXXFXXXYX (SEQ ID NO: 1 1), wherein X is any amino acid. 

32. The kit of claim 31, wherein said mutation or modification is at 
position R (Arg) and/or F (Phe) and/or K (Lys) of said O-helix or combinations 
thereof 

33 . The method of claim 3 1 , wherein said mutation or modification is 
an amino acid substitution at position R and/or F and/or K of said O-helix or 
combinations thereof. 

34. A polymerase which has been modified or mutated to reduce, 
substantially reduce or eliminate the ability of the polymerase to add non- 
templated 3* nucleotides to a synthesized nucleic acid molecule. 

35. The polymerase of claim 34, wherein said polymerase is a DNA or 
RNA polymerase. 

36. The polymerase of claim 34, wherein said polymerase is 
substantially pure. 

37. The polymerase of claim 34, wherein said polymerase is mesophilic 
or thermostable. 
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38. The polymerase of claim 34, wherein said polymerase is selected 
from the group consisting of The DNA polymerase, Taq DNA polymerase, Tma 
DNA polymerase, Tth DNA polymerase, 77/ DNA polymerase, VENT™ DNA 
polymerase, Pfu DNA polymerase, DEEPVENT™ DNA polymerase, Pwo DNA 

5 polymerase, Bst DNA polymerase, Bca DNA polymerase, Tjl DNA polymerase, 

and mutants, variants and derivatives thereof. 

39. The polymerase of claim 34, wherein said polymerase is 
substantially reduced in at least one activity selected from the group consisting of: 

(a) 3'-+ 5* exonuclease activity; and 
10 (b) 5'->y exonuclease activity. 

40. The polymerase of claim 39, wherein said polymerase is 
substantially reduced in 3-5' exonuclease activity. 

41. The polymerase of claim 39, wherein said polymerase is 
substantially reduced in 5'-3 ! exonuclease activity. 

15 42. The polymerase of claim 41, which is modified or mutated to 

reduce or eliminate 3'-5 ! exonuclease activity. 

43. The polymerase of claim 34, which comprises one or more 
modifications or mutations in the O-helix of said polymerase. 

44. The polymerase of claim 43, wherein said O-helix is defined as 
20 RXXXKXXXFXXXYX (SEQ ID NO : 1 1 ), wherein X is any amino acid. 

45. The polymerase of claim 44, wherein said mutation or modification 
is at position R (Arg) and/or F (Phe) and/or K (Lys) of said O-helix or 
combinations thereof. 
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46. The polymerase of claim 44, wherein said mutation or modification 
is an amino acid substitution at position R and/or F and/or K of said O-helix or 
combinations thereof. 

47. The polymerase of claim 46, wherein R (Arg) is substituted with 
an amino acid selected from the group consisting of Ala, Asn, Asp, Cys, Gin, Glu, 
Gly, His, He, Leu, Lys, Met, Phe, Pro, Ser, Thr, Trp, Try and Val. 

48. The polymerase of claim 46, wherein R (Arg) is substituted with 
Lys or His. 

49. The polymerase of claim 46, wherein F (Phe) is substituted with an 
amino acid selected from the group consisting of Ala, Asn, Arg, Asp, Cys, Gin, 
Glu, Gly, His, lie, Leu, Lys, Met, Pro, Ser, Thr, Trp, Try and Val. 

50. The polymerase of claim 46, wherein K (Lys) is substituted with 
an amino acid selected from the group consisting of Ala, Arg, Asn, Asp, Cys, Gin, 
Glu, Gly, His, lie, Leu, Met, Phe, Pro, Ser, Thr, Trp, Try and Val. 

51. The polymerase of claim 46, wherein K (Lys) is substituted with 
Arg or His. 

52. A mutant Tne DNA polymerase protein selected from the group 
consisting of: 

TneWto\9, D323A; 
7wN'A283,D323A; 
TneWL2%4, D323A; 
7>ieN , Al93,D323A; 
7heD137A, D323A; 
7heD8A, D323A; 
Tne G195D, D323A; 
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Tne G37D, D323A; 
7weN'A283; 

Tne D137A, D323A, R722K; 

Tne D137A, D323A, R722Y; 

Tne D137A, D323A, R722L; 

Tne D137A, D323A, R722H; 

Tne D137A, D323A, R722Q; 

Tne D137A, D323A, F730Y; 

Tne D137A, D323A, K726R; 

Tne D137A, D323A, K726H; 

Tne D137A, D323A, R722K, F730Y; 

Tne D137A, D323A, R722K, K726R; 

Tne D137A, D323A, R722K, K726H; 

Tne D137A, D323A, R722H, F730Y; 

Tne D137A, D323A, R722H, K726R; 

Tne D137A, D323A, R722H, K726H; 

Tne D137A, D323A, R722Q, F730Y; 

Tne D137A, D323A, R722Q, K726R; 

Tne D137A, D323A, R722Q, K726H; 

Tne D137A, D323A, R722N, F730Y; 

Tne D137A, D323A, R722N, K726R 

Tne D137A, D323A, R722N, K726H; 

Tne D137A, D323A, F730S; 

Tne N*A283, D323A, R722K/H/Q/N/Y/L; 

Tne N-A219, D323A, R722K; 

7>ieN*A219,D323A,F730Y; 

Tne N'A219, D323A, K726R; 

Tne N'A219, D323A, K726H; 

Tne D137A, D323A, F730S, R722K/Y/Q/N/H/L, K726R/H; 
Tne D137A, D323A, F730T, R722K/Y/Q/N/H/L, K726R/H; 
Tne D137A, D323A, F730T; 
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7*eF730S; 

rneF730A; 

7weK726R; 

Tne K726H; and 

Tne D137A, D323A, R722N. 

53 . A vector comprising a gene encoding the polymerase of claim 34, 

54. The vector of claim 53, wherein said gene is operably linked to a 
promoter. 

55. The vector of claim 54, wherein said promoter is selected from the 
group consisting of a X-P L promoter, a tac promoter, a trp promoter, and a trc 



56. ' A host cell comprising the vector of claim 53 . 

57. A method of producing a polymerase, said method comprising: 
(a) culturing the host cell of claim 56; 



58 . A method of synthesizing a nucleic acid molecule comprising: 

(a) mixing a nucleic acid template with one or more 
polymerases of claim 34; and 



a nucleic acid molecule complementary to all or a portion of said template. 

59. The method of claim 58, wherein said mixture further comprises 
one or more nucleotides selected from the group consisting of dATP, dCTP, 
dGTP, dTTP, dITP, 7-deaza-dGTP, dUTP, ddATP, ddCTP, ddGTP, ddlTP, 



promoter. 



15 



(b) expressing said gene; and 

(c) isolating said polymerase from said host cell. 



20 
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ddTTP, [<x-S]dATP, [a-S]dTTP, [a-S]dGTP, and [a-S]dCTP. 

60. The method of claim 59, wherein one or more of said nucleotides 
are detectably labeled. 

61. A method of sequencing a DN A molecule, comprising: 
5 (a) hybridizing a primer to a first DNA molecule; 

(b) contacting said DNA molecule of step (a) with 
deoxyribonucleoside triphosphates, one or more DNA polymerases of claim 34, 
and a terminator nucleotide; 

(c) incubating the mixture of step (b) under conditions 
10 sufficient to synthesize a random population of DNA molecules complementary 

to said first DNA molecule, wherein said synthesized DNA molecules are shorter 
in length than said first DNA molecule and wherein said synthesized DNA 
molecules comprise a terminator nucleotide at their 3' termini; and 

(d) separating said synthesized DNA molecules by size so that 
15 at least a part of the nucleotide sequence of said first DNA molecule can be 

determined. 

62. The method of claim 61, wherein said deoxyribonucleoside 
triphosphates are selected from the group consisting of dATP, dCTP, dGTP, 
dTTP, dITP, 7-deaza-dGTP, dUTP, [a-S]dATP, [a-S]dTTP, [a-SJdGTP, and 

20 [ct-S]dCTP. 

63. The method of claim 61, wherein said terminator nucleotide is 
ddTTP, ddATP, ddGTP, ddlTP or ddCTP. 

64. The method of claim 61, wherein one or more of said 
deoxyribonucleoside triphosphates is detectably labeled. 



25 



65. 



The method of claim 61, wherein one or more of said terminator 
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nucleotides is detectably labeled. 

66. A method for amplifying a double stranded DNA molecule, 
comprising: 

(a) providing a first and second primer, wherein said first 
5 primer is complementary to a sequence at or near the 3'-termini of the first strand 

of said DNA molecule and said second primer is complementary to a sequence at 
or near the 3'-termini of the second strand of said DNA molecule; 

(b) hybridizing said first prima- to said first strand and said 
second primer to said second strand in the presence of the one or more DNA 

10 polymerases of claims 34, under conditions such that a third DNA molecule 

complementary to said first strand and a fourth DNA molecule complementary to 
said second strand are synthesized; 

(c) denaturing said first and third strand, and said second and 

fourth strands; and 

15 (d) repeating steps (a) to (c) one or more times. 

67. A kit for sequencing, amplifying or sequencing a DNA molecule 
comprising one or more polymerases of claim 34. 

68. The kit of claim 67, further comprising one or more 
dideoxyribonucleoside triphosphates and/or one or more deoxyribonucleoside 

20 triphosphates. 



WO 98/35060 



PCT/DS98/02791 



1/16 



Of 





-WO 98/35060 



PCTAJS98/02791 



2/16 




FIG.iA 



-WO 98/35060 



PCT/US98/02791 



3/16 




FIG.i-B 



WO 98/35060 PCT/US98/02791 




WO 98/35060 



PCT/US98/02791 



5/16 



Sstl 




pTTQ19 



Sphl+Snftl 



Kndltl 
I DIGEST VITH HindHI 

z rn.L-!N 

3. DIGEST VITH Spht 

4 ISOLATE Trie pol-FRAGHENT 



toe 



BonHl 
Hpal 




1. Sphl*Spel DIGESTION 

2. ISOLATE YECTOR PLUS SMALL 
REHAINIG Tne Pol FRAGMENT 



'BonHl 
ttndlll 
LSphl+Spel DIGESTION 

2. Isolate Tne w fragment 




BonHl 

V #^HpoI 
\. BonHl 




Fl<5- 5T 



WO 98/35060 



PCT/US98/02791 



7/16 

C 1 2 3 4 5 6 7 8 9 10111213141516171819 
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C Control 
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Applicant's or agent's file 
reference number 0942.425PC02 



International application No. TO BE ASSIGNED 



JCFAJS 98 '027 9*1 

INDICATIONS RELATING TO A DEPOSITED MICROORGANISM w - ' r* 

(PCTRuleBta) 



A. The indications made below relate to the microorganism referred to in the description on pagd£?_ linrU, 



.— ' — ^ 



B. IDENTIFICATION OF DEPOSIT Further deposils m identified on an additional sheet E) 



Name of depositary institution: Agricultural Research Service Culture Collection (NRRL) 



Address of depositary institution (including postal code and country) 

1815 North University Street 
Peoria, Illinois 61604 
United States of America 



Date of deposit: 30 September 1994 



Accession Number: NRRLB-21238 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet □ 



Kcoli/DH10B(pVC'Tne) 

In respect of those designations in which a European Patent is sought a sample of the deposited microorganism will be made 
available until the publication of the mention of the grant of the European patent or until the date on which the application has been 
refused or withdrawn or is deemed to be withdrawn, only by the issue of such a sample to an expert nominated by the person 
requesting the sample (Rule 28(4) EPC). _^_ 9= ^ S= _ =HH 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS d*™ tumk if*** appikabuj 



The indications listed below will be submitted to the international Bureau later (specify the general nature of the indications, e.g., 
"Accession Number of Deposit") 







Nj^This sheet was received with the international application 


□ This sheet was received by the International Bureau on: 


Authorized officer 


Authorized officer 



Form PCT/RO/134 (July 1992) 



425PC2.EPD 



(E co/i (pUC-Tne)) 
AUSTRALIA 

The applicant hereby gives notice that the furnishing of a sample of a microorganism shall only 
be effected prior to the grant of a patent, or prior to the lapsing, refusal or withdrawal of the 
application, to a person who is a skilled addressee without an interest in the invention 
(Regulation 3.25(3) of the Australian Patents Regulations). 

CANADA 

The applicant hereby requests that, until either a Canadian patent has been issued on the basis of 
the application or the application has been refused, or is abandoned and no longer subject to 
reinstatement, or is withdrawn, the furnishing of a sample of deposited biological material 
referred to in the application only be effected to an independent expert nominated by the 
Commissioner of Patents. 

DENMARK 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Danish Patent Office), or has been finally decided upon by the Danish Patent Office 
without having been laid open to public inspection, the furnishing of a sample shall only be 
effected to an expert in the art. The request to this effect shall be filed by the applicant with the 
Danish Patent Office not later than at the time when the application is made available to the 
public under Sections 22 and 33(3) of the Danish Patents Act. If such a request has been filed 
by the applicant, any request made by a third party for the furnishing of a sample shall indicate 
the expert to be used. That expert may be any person entered on a list of recognized experts 
drawn up by the Danish Patent office or any person approved by the applicant in the individual 
case. 

FINLAND 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the National Board of Patents and Registration), or has been finally decided upon by the 
National Board of Patents and Registration without having been laid open to public inspection, 
the furnishing of a sample shall only be effected to an expert in the art The request to this effect 
shall be filed by the applicant with the International Bureau before the expiration of 16 months 
from the priority date (preferably on the Form PCT/RO/134 reproduced in annex Z of Volume 
I of the PCT Applicant's Guide). If such a request has been filed by the applicant, any request 
made by a third party for the furnishing of a sample shall indicate the expert to be used. That 
expert may be any person entered on a list of recognized experts drawn up by the National Board 
of Patents and Registration or any person approved by the applicant in the individual case. 

ICELAND 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Icelandic Patent Office), or has been finally decided upon by the Icelandic Patent Office 
without having been laid open to public inspection, the furnishing of a sample shall only be 
effected to an expert in the art. 





International application No. T E ASSIGNED 




Applicant's or agenfs file 

reference number 0942.425PC02 


PC1WS 98/Q27 9ft1 





INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRulel3Wy) 



A. The indications made below relate to the microorganism referred to in the description on page ff8, line 2. 



B. IDENTIFICATION OF DEPOSIT Furthcr deposits are identified on an additional sheet. □ 



Name of depositary institution Agricultural Research Service Culture Collection (NRRL) 



Address of depositary institution (including postal code and country) 



1815 North University Street 




Peoria, Illinois 61604 




United States of America 




Date of deposit 30 September 1994 


Accession Number NRRL B-21338 


C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet □ 



E.coli/DHI0B(pVC\9-Tne) 



In respect of those designations in which a European Patent is sought a sample of the deposited microorganism will be made 
available until the publication of the mention of the grant of the European patent or until the date on which the application has been 
refused or withdrawn or is deemed to be withdrawn, only by the issue of such a sample to an expert nominated by the person 
requesting the sample (Rule 28(4) EPC) 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States} 



E. SEPARATE FURNISHING OF INDICATIONS (tea* blank If not applicable) 

The indications listed below will be submitted to the international Bureau later (specify the general nature of the indications, e.g., 
"Accession Number of Deposit") 



For receiving Office use only 



\ For receiving urtic 



For International Bureau use only 



FJDUS 98/027 9QH 



(E. coliDHlOB (pUC-Tne)) 

The applicant hereby requests that until the date of a grant of a Netherlands patent or until the 
date on which the application is refused or withdrawn or lapsed, the microorganism shall be 
made available as provided in Rule 31F(1) of the Patent Rules only by the issue of a sample to 
an expert. The request to this effect must be furnished by the applicant with the Netherlands 
Industrial Property Office before the date on which the application is made available to the public 
under Section 22C or Section 25 of the Patents Act of the Kingdom of the Netherlands, 
whichever of the two dates occurs earlier. 

NORWAY 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Norwegian Patent Office), or has been finally decided upon by the Norwegian Patent 
Office without having been laid open to public inspection, the furnishing of a sample shall only 
be effected to an expert in the art. The request to this effect shall be filed by the applicant with 
the Norwegian Patent Office not later than at the time when the application is made available to 
the public under Sections 22 and 33(3) of the Norwegian Patents Act. If such a request has been 
filed by the applicant, any request made by a third party for the furnishing of a sample shall 
indicate the expert to be used. That expert may be any person entered on a list of recognized 
experts drawn up by the Norwegian Patent office or any person approved by the applicant in the 
individual case. 

SINGAPORE 

The applicant hereby requests that the furnishing of a sample of a microorganism shall only be 
made available to an expert. The request to this effect must be filed by the applicant with the 
International Bureau before the completion of the technical preparations for international 
publication of the application. 

SWEDEN 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Swedish Patent Office), or has been finally decided upon by the Swedish Patent Office 
without having been laid open to public inspection, .the furnishing of a sample shall only be 
effected to an expert in the art The request to this effect shall be filed by the applicant with the 
International Bureau before the expiration of 16 months from the priority date (preferably on the 
Form PCT/RO/134 reproduced in annex Z of Volume I of the PCT Applicant's Guide). If such 
a request has been filed by the applicant, any request made by a third party for the furnishing of 
a sample shall indicate the expert to be used. That expert may be any person entered on a list of 
recognized experts drawn up by the Swedish Patent office or any person approved by the applicant 
in the individual case. 

UNITED KINGDOM 

The applicant hereby requests .that the furnishing of a sample of a microorganism shall only be 
made available to an expert. The request to this effect must be filed by the applicant with the 
International Bureau before the completion of the technical preparations for international 
publication of the application. 



PCTJS 98/02790-1 



(E. coliDHlOB (pVC19-Tne)) 
AUSTRALIA 

The applicant hereby gives notice that the furnishing of a sample of a microorganism shall only 
be effected prior to the grant of a patent, or prior to the lapsing, refusal or withdrawal of the 
application, to a person who is a skilled addressee without an interest in the invention 
(Regulation 3.25(3) of the Australian Patents Regulations). 

CANADA 

The applicant hereby requests that, until either a Canadian patent has been issued on the basis of 
the application or the application has been refused, or is abandoned and no longer subject to 
reinstatement, or is withdrawn, the furnishing of a sample of deposited biological material 
referred to in the application only be effected to an independent expert nominated by the 
Commissioner of Patents. 

DENMARK 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Danish Patent Office), or has been finally decided upon by the Danish Patent Office 
without having been laid open to public inspection, the furnishing of a sample shall only be 
effected to an expert in the art. The request to this effect shall be filed by the applicant with the 
Danish Patent Office not later than at the time when the application is made available to the 
public under Sections 22 and 33(3) of the Danish Patents Act. If such a request has been filed 
by the applicant, any request made by a third party for the furnishing of a sample shall indicate 
the expert to be used. That expert may be any person entered on a list of recognized experts 
drawn up by the Danish Patent office or any person approved by the applicant in the individual 
case. 

FINLAND 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the National Board of Patents and Registration), or has been finally decided upon by the 
National Board of Patents and Registration without having been laid open to public inspection, 
the furnishing of a sample shall only be effected to an expert in the art The request to this effect 
shall be filed by the applicant with the International Bureau before the expiration of 16 months 
from the priority date (preferably on the Form PCT/RO/134 reproduced in annex Z of Volume 
I of the PCT Applicant's Guide). If such a request has been filed by the applicant, any request 
made by a third party for the furnishing of a sample shall indicate the expert to be used. That 
expert may be any person entered on a list of recognized experts drawn up by the National Board 
of Patents and Registration or any person approved by the applicant in the individual case. 

ICELAND 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Icelandic Patent Office), or has been finally decided upon by the Icelandic Patent Office 
without having been laid open to public inspection, the furnishing of a sample shall only be 
effected to an expert in the art. 



(E. coliDHlOB (pUC19-Tne)) 

The applicant hereby requests that until the date of a grant of a Netherlands patent or until the 
date on which the application is refused or withdrawn or lapsed, the microorganism shall be 
made available as provided in Rule 3 1F(1) of the Patent Rules only by the issue of a sample to 
an expert. The request to this effect must be furnished by the applicant with the Netherlands 
Industrial Property Office before the date on which the application is made available to the public 
under Section 22C or Section 25 of the Patents Act of the Kingdom of the Netherlands, 
whichever of the two dates occurs earlier. 

NORWAY 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Norwegian Patent Office), or has been finally decided upon by the Norwegian Patent 
Office without having been laid open to public inspection, the furnishing of a sample shall only 
be effected to an expert in the art. The request to this effect shall be filed by the applicant with 
the Norwegian Patent Office not later than at the time when the application is made available to 
the public under Sections 22 and 33(3) of the Norwegian Patents Act. If such a request has been 
filed by the applicant, any request made by a third party for the furnishing of a sample shall 
indicate the expert to be used. That expert may be any person entered on a list of recognized 
experts drawn up by the Norwegian Patent office or any person approved by the applicant in the 
individual case. 

SINGAPORE 

The applicant hereby requests that the furnishing of a sample of a microorganism shall only be 
made available to an expert. The request to this effect must be filed by the applicant with the 
International Bureau before the completion of the technical preparations for international 
publication of the application. 

SWEDEN 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Swedish Patent Office), or has been finally decided upon by the Swedish Patent Office 
without having been laid open to public inspection, .the furnishing of a sample shall only be 
effected to an expert in the art. The request to this effect shall be filed by the applicant with the 
International Bureau before the expiration of 16 months from the priority date (preferably on the 
Form PCT/RO/134 reproduced in annex Z of Volume I of the PCT Applicant's Guide). If such 
a request has been filed by the applicant, any request made by a third party for the furnishing of 
a sample shall indicate the expert to be used. That expert may be any person entered on a list of 
recognized experts drawn up by the Swedish Patent office or any person approved by the applicant 
in the individual case. 

UNITED KINGDOM 

The applicant hereby requests that the furnishing of a sample of a microorganism shall only be 
made available to an expert. The request to this effect must be filed by the applicant with the 
International Bureau before the completion of the technical preparations for international 
publication of the application. 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/US9o70279| 



A. CLASSIFICATION OF SUBJECT MATTER 

IPC(6) :C12Q 1/68; C12P 19/34; C12N 9/12, 15/00, 15/63, 15/85 
US CL : 435/6, 69.1, 91.2. 194, 325, 320.1 
According to International Patent C lass ifi cation (IPC ) or to both national classification and IPC 

B. FIELDS SEARCHED 

Minimum documentation searched (classification system followed by classification symbols) 

U.S. : 435/6. 69.1, 91.2, 194. 325, 320.1 
Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 



Electronic data base consulted during the international search (name of data base and, where practicable, search terms used) 
Searched inventors and keywords: thermotoga and polymerase with modified exonuclease or neapolitana in APS, CAPLUS. 
MEDLINE, SCISEARHC, WPIDS. 



DOCUMENTS CONSIDERED TO BE RELEVANT 



Category* 


Citation of document, with indication, where appropriate, of the relevant passages 


Relevant to claim No. 


X 


WO 96/10640 Al (LIFE TECHNOLOGIES, INC.) 11 April 1996, 
see pages 4-5 and pages 40-51. 


1-19,21-51,53-68 


X 


WO 96/41014 Al (PROMEGA CORPORATION) 19 December 
1996, see pages 10-17 and 185-191. 


1-19,21-51, 53-68 


X 
Y 


US 5,420,029 A (GELFAND et al) 30 May 1995, see col. 2-3 and 
col. 73-76. 


1-15,23-29,34- 
42,53-57,66-68 




21-22,58-64 


A 


US 5,466,591 A (ABRAMSON et al) 14 November 1995, see entire 
document. 


1-68 


A 


US 5,541,099 A (CHATTERJEE) 30 July 1996, see entire 
document. 


1-68 



"x| Further documents are listed in the continuation of Box C. Q See patent family annex. 



Special categories of cited documents: 

document defining the general state or the art which u not considered 
to be of particular relevance 



"B* 



•X" 



earlier document published oa or after the international filing data 

document which nay throw doubts on priority clsun(t) or which is 
cited to establish the publication data of another citation or other 
special roesoa (as specified) 

document referring to an oral disclosure, use. exhibition or other 



•P* document published prior to the tnternationsl filing date but later than » & - 



later document published after the international filing date or priority 
date end not in conflict with the application but cited to understand 
the principle or theory underlying the invention 

document of particular relevance; the claimed invention cannot be 
considered novel or cannot be considered to involve an inventive step 
when the document is taken alone 

document of particular relevance; the claimed invention 
considered ■ to involve an inventive step when the do 
combined with one or more other such documents, such cotnl 
being obvious to a person skilled in the art 

document member of the same patent family 



Date of the actual completion of the international search 
13 APRIL 1998 


Date of mailing of the^interaational search report 

ftto MAY 199H. 


Name and mailing address of the ISA/US 
Commissioner of Patents and Trademarks 
Box PCX 

Washingtoiw D.C. 20231 
Facsimile No. (703) 305-3230 


Authorized oj 
TalefDoneN 


j. (703) 308-1096 Q 



Form PCT/ISA/210 (second sheetXJuly 1992)* 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/US98/D2791 



C (Continuation). DOCUMENTS CONSIDERED TO BE RELEVANT 



Category* 


Citation of document, with indication, where appropriate, of the relevant passages 


Relevant to claim No. 


A 


US 5,489,523 A (MATHUR) 06 February 1996, see entire 
document. 


1-68 



Form PCT/ISA/210 (continuation of second shectXJuty 1992)* 



